You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Stefan Bergstrand <st...@polopoly.com> on 2002/01/18 12:16:44 UTC

Searching for words starting with swedish characters.

When I search in an index for a word that starts with an eight-bit
character I get:


Query: åsnan
Exception in thread "main" org.apache.lucene.queryParser.TokenMgrError: Lexical error at line 1, column 1.  Encountered: "\u00e5" (229), after : ""
        at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.jj_ntk(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.Modifiers(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.Query(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
        at Lucsearch.main(Lucsearch.java:36)



(My class "Lucsearch", is a slightly modified version of the demo
class "SearchFiles.java".)

It works fine when the problematic character is later in the word.

Any ideas?

/Stefan Bergstrand

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>


Re: Searching for words starting with swedish characters.

Posted by Stefan Bergstrand <st...@polopoly.com>.
When I tried using the latest CVS-version of Lucene it worked
fine. Sorry to have bothered you.

/Stefan Bergstrand


Stefan Bergstrand <st...@polopoly.com> writes:

> When I search in an index for a word that starts with an eight-bit
> character I get:
> 
> 
> Query: åsnan
> Exception in thread "main" org.apache.lucene.queryParser.TokenMgrError: Lexical error at line 1, column 1.  Encountered: "\u00e5" (229), after : ""
>         at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(Unknown Source)
>         at org.apache.lucene.queryParser.QueryParser.jj_ntk(Unknown Source)
>         at org.apache.lucene.queryParser.QueryParser.Modifiers(Unknown Source)
>         at org.apache.lucene.queryParser.QueryParser.Query(Unknown Source)
>         at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
>         at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
>         at Lucsearch.main(Lucsearch.java:36)
> 
> 
> 
> (My class "Lucsearch", is a slightly modified version of the demo
> class "SearchFiles.java".)
> 
> It works fine when the problematic character is later in the word.
> 
> Any ideas?
> 
> /Stefan Bergstrand
> 
> --
> To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
> For additional commands, e-mail: <ma...@jakarta.apache.org>

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>