You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "luciano aparecido dos santos (JIRA)" <ji...@apache.org> on 2013/04/04 20:05:17 UTC

[jira] [Created] (LUCENE-4907) Tokenizer

luciano aparecido dos santos created LUCENE-4907:
----------------------------------------------------

             Summary: Tokenizer
                 Key: LUCENE-4907
                 URL: https://issues.apache.org/jira/browse/LUCENE-4907
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
    Affects Versions: 4.2
         Environment: WINDOWS 7, NETBEANS 7.2.1
            Reporter: luciano aparecido dos santos


Hi,

    I'm trying to remove the accents in the search, find words removing special characters such as accents ("ANDRÉ" equals "ANDRE").
I found the class ICUTokenizer but got the Exception in thread "main" java.lang.NoSuchMethodError: com.ibm.icu.text.UnicodeSet.freeze()Lcom/ibm/icu/text/UnicodeSet;
	at org.apache.lucene.analysis.icu.segmentation.LaoBreakIterator.<clinit>(LaoBreakIterator.java:73)
	at org.apache.lucene.analysis.icu.segmentation.DefaultICUTokenizerConfig.<clinit>(DefaultICUTokenizerConfig.java:73)


      Somebody can help me?    

Page ref.: http://lucene.apache.org/core/4_2_0/analyzers-icu/index.html



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org