You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Scott Smith <ss...@mainstreamdata.com> on 2012/11/14 18:47:58 UTC
CJKWidthFilter vs ICUFoldingFilter
Reading the documentation for these two filters seems to imply that CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter because I need umlauts and accent characters removed from any German, French, etc.
Can I just use the ICUFoldingFilter?
Cheers
Scott
RE: CJKWidthFilter vs ICUFoldingFilter
Posted by Scott Smith <ss...@mainstreamdata.com>.
Thanks
-----Original Message-----
From: Robert Muir [mailto:rcmuir@gmail.com]
Sent: Wednesday, November 14, 2012 12:17 PM
To: java-user@lucene.apache.org
Subject: Re: CJKWidthFilter vs ICUFoldingFilter
On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith <ss...@mainstreamdata.com> wrote:
> Reading the documentation for these two filters seems to imply that CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter because I need umlauts and accent characters removed from any German, French, etc.
>
> Can I just use the ICUFoldingFilter?
Yes. its a subset of NFKC, which is a subset of ICUFolding filter :)
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: CJKWidthFilter vs ICUFoldingFilter
Posted by Robert Muir <rc...@gmail.com>.
On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith <ss...@mainstreamdata.com> wrote:
> Reading the documentation for these two filters seems to imply that CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter because I need umlauts and accent characters removed from any German, French, etc.
>
> Can I just use the ICUFoldingFilter?
Yes. its a subset of NFKC, which is a subset of ICUFolding filter :)
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org