You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Jerome Blouin <jb...@expedia.com> on 2013/03/22 17:22:50 UTC

Accent insensitive analyzer

Hello,

I'm looking for an analyzer that allows performing accent insensitive search in latin languages. I'm currently using the StandardAnalyzer but it doesn't fulfill this need. Could you please point me to the one I need to use? I've checked the javadoc for the various analyzer packages but can't find one. Do I need to implement my own analyzer?

Regards,
Jerome


Re: Accent insensitive analyzer

Posted by Erick Erickson <er...@gmail.com>.
ISOLatin1AccentFilter has been deprecated for quite some time,
ASCIIFoldingFilter is preferred....

Best
Erick


On Fri, Mar 22, 2013 at 2:59 PM, Jerome Blouin <jb...@expedia.com> wrote:

> Thanks. I'll check that later.
>
> -----Original Message-----
> From: Sujit Pal [mailto:sujitatgtalk@gmail.com] On Behalf Of SUJIT PAL
> Sent: Friday, March 22, 2013 2:52 PM
> To: java-user@lucene.apache.org
> Subject: Re: Accent insensitive analyzer
>
> Hi Jerome,
>
> How about this one?
>
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ISOLatin1AccentFilterFactory
>
> Regards,
> Sujit
>
> On Mar 22, 2013, at 9:22 AM, Jerome Blouin wrote:
>
> > Hello,
> >
> > I'm looking for an analyzer that allows performing accent insensitive
> search in latin languages. I'm currently using the StandardAnalyzer but it
> doesn't fulfill this need. Could you please point me to the one I need to
> use? I've checked the javadoc for the various analyzer packages but can't
> find one. Do I need to implement my own analyzer?
> >
> > Regards,
> > Jerome
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

RE: Accent insensitive analyzer

Posted by Jerome Blouin <jb...@expedia.com>.
Thanks. I'll check that later.

-----Original Message-----
From: Sujit Pal [mailto:sujitatgtalk@gmail.com] On Behalf Of SUJIT PAL
Sent: Friday, March 22, 2013 2:52 PM
To: java-user@lucene.apache.org
Subject: Re: Accent insensitive analyzer

Hi Jerome,

How about this one?

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ISOLatin1AccentFilterFactory

Regards,
Sujit

On Mar 22, 2013, at 9:22 AM, Jerome Blouin wrote:

> Hello,
> 
> I'm looking for an analyzer that allows performing accent insensitive search in latin languages. I'm currently using the StandardAnalyzer but it doesn't fulfill this need. Could you please point me to the one I need to use? I've checked the javadoc for the various analyzer packages but can't find one. Do I need to implement my own analyzer?
> 
> Regards,
> Jerome
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Accent insensitive analyzer

Posted by SUJIT PAL <su...@comcast.net>.
Hi Jerome,

How about this one?

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ISOLatin1AccentFilterFactory

Regards,
Sujit

On Mar 22, 2013, at 9:22 AM, Jerome Blouin wrote:

> Hello,
> 
> I'm looking for an analyzer that allows performing accent insensitive search in latin languages. I'm currently using the StandardAnalyzer but it doesn't fulfill this need. Could you please point me to the one I need to use? I've checked the javadoc for the various analyzer packages but can't find one. Do I need to implement my own analyzer?
> 
> Regards,
> Jerome
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Accent insensitive analyzer

Posted by Jack Krupansky <ja...@basetechnology.com>.
Start with the Standard Tokenizer:
https://lucene.apache.org/core/4_2_0/analyzers-common/org/apache/lucene/analysis/standard/StandardTokenizer.html

-- Jack Krupansky

-----Original Message----- 
From: Jerome Blouin
Sent: Friday, March 22, 2013 12:53 PM
To: java-user@lucene.apache.org
Subject: RE: Accent insensitive analyzer

I understand that I can't configure it on an analyzer so on which class can 
I apply it?

Thank,
Jerome

-----Original Message-----
From: Jack Krupansky [mailto:jack@basetechnology.com]
Sent: Friday, March 22, 2013 12:38 PM
To: java-user@lucene.apache.org
Subject: Re: Accent insensitive analyzer

Try the ASCII Folding FIlter:
https://lucene.apache.org/core/4_2_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html

-- Jack Krupansky

-----Original Message-----
From: Jerome Blouin
Sent: Friday, March 22, 2013 12:22 PM
To: java-user@lucene.apache.org
Subject: Accent insensitive analyzer

Hello,

I'm looking for an analyzer that allows performing accent insensitive search 
in latin languages. I'm currently using the StandardAnalyzer but it doesn't 
fulfill this need. Could you please point me to the one I need to use? I've 
checked the javadoc for the various analyzer packages but can't find one. Do 
I need to implement my own analyzer?

Regards,
Jerome


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


RE: Accent insensitive analyzer

Posted by Jerome Blouin <jb...@expedia.com>.
I understand that I can't configure it on an analyzer so on which class can I apply it?

Thank,
Jerome

-----Original Message-----
From: Jack Krupansky [mailto:jack@basetechnology.com] 
Sent: Friday, March 22, 2013 12:38 PM
To: java-user@lucene.apache.org
Subject: Re: Accent insensitive analyzer

Try the ASCII Folding FIlter:
https://lucene.apache.org/core/4_2_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html

-- Jack Krupansky

-----Original Message-----
From: Jerome Blouin
Sent: Friday, March 22, 2013 12:22 PM
To: java-user@lucene.apache.org
Subject: Accent insensitive analyzer

Hello,

I'm looking for an analyzer that allows performing accent insensitive search in latin languages. I'm currently using the StandardAnalyzer but it doesn't fulfill this need. Could you please point me to the one I need to use? I've checked the javadoc for the various analyzer packages but can't find one. Do I need to implement my own analyzer?

Regards,
Jerome


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Accent insensitive analyzer

Posted by Jack Krupansky <ja...@basetechnology.com>.
Try the ASCII Folding FIlter:
https://lucene.apache.org/core/4_2_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html

-- Jack Krupansky

-----Original Message----- 
From: Jerome Blouin
Sent: Friday, March 22, 2013 12:22 PM
To: java-user@lucene.apache.org
Subject: Accent insensitive analyzer

Hello,

I'm looking for an analyzer that allows performing accent insensitive search 
in latin languages. I'm currently using the StandardAnalyzer but it doesn't 
fulfill this need. Could you please point me to the one I need to use? I've 
checked the javadoc for the various analyzer packages but can't find one. Do 
I need to implement my own analyzer?

Regards,
Jerome


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org