You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Doug Treder (JIRA)" <ji...@apache.org> on 2009/06/18 19:58:07 UTC

[jira] Created: (SOLR-1231) query parser fails parsing umlaut character

query parser fails parsing umlaut character
-------------------------------------------

                 Key: SOLR-1231
                 URL: https://issues.apache.org/jira/browse/SOLR-1231
             Project: Solr
          Issue Type: Bug
          Components: search
    Affects Versions: 1.4
         Environment: Linux
            Reporter: Doug Treder


The query URL:

http://hostame:8080/Index_1/select?q=(geh%FCrt)+OR+field_id:0

is correctly parsed and returns a 200 response.

http://hostname:8080/Index_1/select?q=(geh%FCrt)

is incorrectly parsed and returns a 400 response:

org.apache.lucene.queryParser.ParseException: Cannot parse '(geh�': Encountered "<EOF>" at line 1, column 5. Was expecting one of: <AND> ... <OR> ... <NOT> ... "+" ... "-" ... "(" ... ")" ... "*" ... "^" ... <QUOTED> ... <TERM> ... <FUZZY_SLOP> ... <PREFIXTERM> ... <WILDTERM> ... "[" ... "{" ... <NUMBER> ...

Here is the full exception from catalina.out:

SEVERE: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
Was expecting one of:
    <AND> ...
    <OR> ...
    <NOT> ...
    "+" ...
    "-" ...
    "(" ...
    ")" ...
    "*" ...
    "^" ...
    <QUOTED> ...
    <TERM> ...
    <FUZZY_SLOP> ...
    <PREFIXTERM> ...
    <WILDTERM> ...
    "[" ...
    "{" ...
    <NUMBER> ...
    
	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:110)
	at com.imdb.solr.handler.component.LoggingSearchHandler.handleRequestBody(LoggingSearchHandler.java:134)
	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:215)
	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:548)
	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
	at org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
	at org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
	at org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
	at java.lang.Thread.run(Unknown Source)
Caused by: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
Was expecting one of:
    <AND> ...
    <OR> ...
    <NOT> ...
    "+" ...
    "-" ...
    "(" ...
    ")" ...
    "*" ...
    "^" ...
    <QUOTED> ...
    <TERM> ...
    <FUZZY_SLOP> ...
    <PREFIXTERM> ...
    <WILDTERM> ...
    "[" ...
    "{" ...
    <NUMBER> ...
    
	at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:177)
	at org.apache.solr.search.LuceneQParser.parse(LuceneQParserPlugin.java:78)
	at org.apache.solr.search.QParser.getQuery(QParser.java:126)
	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:91)
	... 20 more



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1231) query parser fails parsing umlaut character

Posted by "Yonik Seeley (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721781#action_12721781 ] 

Yonik Seeley commented on SOLR-1231:
------------------------------------

but %FC is only correct under a latin-1 URL encoding (default for tomcat but not for jetty).  The correct encoding per the RFCs (and for Jetty but not Tomcat by default) would be %C3%BC

> query parser fails parsing umlaut character
> -------------------------------------------
>
>                 Key: SOLR-1231
>                 URL: https://issues.apache.org/jira/browse/SOLR-1231
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>         Environment: Linux
>            Reporter: Doug Treder
>
> The query URL:
> http://hostame:8080/Index_1/select?q=(geh%FCrt)+OR+field_id:0
> is correctly parsed and returns a 200 response.
> http://hostname:8080/Index_1/select?q=(geh%FCrt)
> is incorrectly parsed and returns a 400 response:
> org.apache.lucene.queryParser.ParseException: Cannot parse '(geh�': Encountered "<EOF>" at line 1, column 5. Was expecting one of: <AND> ... <OR> ... <NOT> ... "+" ... "-" ... "(" ... ")" ... "*" ... "^" ... <QUOTED> ... <TERM> ... <FUZZY_SLOP> ... <PREFIXTERM> ... <WILDTERM> ... "[" ... "{" ... <NUMBER> ...
> Here is the full exception from catalina.out:
> SEVERE: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:110)
> 	at com.imdb.solr.handler.component.LoggingSearchHandler.handleRequestBody(LoggingSearchHandler.java:134)
> 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
> 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
> 	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
> 	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:215)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
> 	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:548)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
> 	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
> 	at org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
> 	at org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
> 	at org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
> 	at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:177)
> 	at org.apache.solr.search.LuceneQParser.parse(LuceneQParserPlugin.java:78)
> 	at org.apache.solr.search.QParser.getQuery(QParser.java:126)
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:91)
> 	... 20 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1231) query parser fails parsing umlaut character

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721814#action_12721814 ] 

Robert Muir commented on SOLR-1231:
-----------------------------------

oops you are right, ignore what i said :)

> query parser fails parsing umlaut character
> -------------------------------------------
>
>                 Key: SOLR-1231
>                 URL: https://issues.apache.org/jira/browse/SOLR-1231
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>         Environment: Linux
>            Reporter: Doug Treder
>
> The query URL:
> http://hostame:8080/Index_1/select?q=(geh%FCrt)+OR+field_id:0
> is correctly parsed and returns a 200 response.
> http://hostname:8080/Index_1/select?q=(geh%FCrt)
> is incorrectly parsed and returns a 400 response:
> org.apache.lucene.queryParser.ParseException: Cannot parse '(geh�': Encountered "<EOF>" at line 1, column 5. Was expecting one of: <AND> ... <OR> ... <NOT> ... "+" ... "-" ... "(" ... ")" ... "*" ... "^" ... <QUOTED> ... <TERM> ... <FUZZY_SLOP> ... <PREFIXTERM> ... <WILDTERM> ... "[" ... "{" ... <NUMBER> ...
> Here is the full exception from catalina.out:
> SEVERE: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:110)
> 	at com.imdb.solr.handler.component.LoggingSearchHandler.handleRequestBody(LoggingSearchHandler.java:134)
> 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
> 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
> 	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
> 	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:215)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
> 	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:548)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
> 	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
> 	at org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
> 	at org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
> 	at org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
> 	at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:177)
> 	at org.apache.solr.search.LuceneQParser.parse(LuceneQParserPlugin.java:78)
> 	at org.apache.solr.search.QParser.getQuery(QParser.java:126)
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:91)
> 	... 20 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1231) query parser fails parsing umlaut character

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721398#action_12721398 ] 

Robert Muir commented on SOLR-1231:
-----------------------------------

expanding on what yonik says, looks like the servlet container to me... the url is correctly encoded (00FC = u with umlaut), but incorrectly interpreted as (FFFD = replacement char)


> query parser fails parsing umlaut character
> -------------------------------------------
>
>                 Key: SOLR-1231
>                 URL: https://issues.apache.org/jira/browse/SOLR-1231
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>         Environment: Linux
>            Reporter: Doug Treder
>
> The query URL:
> http://hostame:8080/Index_1/select?q=(geh%FCrt)+OR+field_id:0
> is correctly parsed and returns a 200 response.
> http://hostname:8080/Index_1/select?q=(geh%FCrt)
> is incorrectly parsed and returns a 400 response:
> org.apache.lucene.queryParser.ParseException: Cannot parse '(geh�': Encountered "<EOF>" at line 1, column 5. Was expecting one of: <AND> ... <OR> ... <NOT> ... "+" ... "-" ... "(" ... ")" ... "*" ... "^" ... <QUOTED> ... <TERM> ... <FUZZY_SLOP> ... <PREFIXTERM> ... <WILDTERM> ... "[" ... "{" ... <NUMBER> ...
> Here is the full exception from catalina.out:
> SEVERE: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:110)
> 	at com.imdb.solr.handler.component.LoggingSearchHandler.handleRequestBody(LoggingSearchHandler.java:134)
> 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
> 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
> 	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
> 	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:215)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
> 	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:548)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
> 	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
> 	at org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
> 	at org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
> 	at org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
> 	at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:177)
> 	at org.apache.solr.search.LuceneQParser.parse(LuceneQParserPlugin.java:78)
> 	at org.apache.solr.search.QParser.getQuery(QParser.java:126)
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:91)
> 	... 20 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (SOLR-1231) query parser fails parsing umlaut character

Posted by "Yonik Seeley (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721388#action_12721388 ] 

Yonik Seeley commented on SOLR-1231:
------------------------------------

What are you searching for?  It's not clear that the URL is correctly encoded.
If you are running with something other than the included jetty server, you may want to check that your servlet container is correctly set up too - try running example/exampledocs/test_utf8.sh to see.


> query parser fails parsing umlaut character
> -------------------------------------------
>
>                 Key: SOLR-1231
>                 URL: https://issues.apache.org/jira/browse/SOLR-1231
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>         Environment: Linux
>            Reporter: Doug Treder
>
> The query URL:
> http://hostame:8080/Index_1/select?q=(geh%FCrt)+OR+field_id:0
> is correctly parsed and returns a 200 response.
> http://hostname:8080/Index_1/select?q=(geh%FCrt)
> is incorrectly parsed and returns a 400 response:
> org.apache.lucene.queryParser.ParseException: Cannot parse '(geh�': Encountered "<EOF>" at line 1, column 5. Was expecting one of: <AND> ... <OR> ... <NOT> ... "+" ... "-" ... "(" ... ")" ... "*" ... "^" ... <QUOTED> ... <TERM> ... <FUZZY_SLOP> ... <PREFIXTERM> ... <WILDTERM> ... "[" ... "{" ... <NUMBER> ...
> Here is the full exception from catalina.out:
> SEVERE: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:110)
> 	at com.imdb.solr.handler.component.LoggingSearchHandler.handleRequestBody(LoggingSearchHandler.java:134)
> 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
> 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
> 	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
> 	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:215)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
> 	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:548)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
> 	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
> 	at org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
> 	at org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
> 	at org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
> 	at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:177)
> 	at org.apache.solr.search.LuceneQParser.parse(LuceneQParserPlugin.java:78)
> 	at org.apache.solr.search.QParser.getQuery(QParser.java:126)
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:91)
> 	... 20 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (SOLR-1231) query parser fails parsing umlaut character

Posted by "Hoss Man (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hoss Man resolved SOLR-1231.
----------------------------

    Resolution: Invalid

per comments, appears to be an incompatibility between URL encoding strategy and servlet container configuration ... not a solr bug.

please reopen with more details about your bug ... preferably including the output from example/exampledocs/test_utf8.sh

> query parser fails parsing umlaut character
> -------------------------------------------
>
>                 Key: SOLR-1231
>                 URL: https://issues.apache.org/jira/browse/SOLR-1231
>             Project: Solr
>          Issue Type: Bug
>          Components: search
>    Affects Versions: 1.4
>         Environment: Linux
>            Reporter: Doug Treder
>
> The query URL:
> http://hostame:8080/Index_1/select?q=(geh%FCrt)+OR+field_id:0
> is correctly parsed and returns a 200 response.
> http://hostname:8080/Index_1/select?q=(geh%FCrt)
> is incorrectly parsed and returns a 400 response:
> org.apache.lucene.queryParser.ParseException: Cannot parse '(geh�': Encountered "<EOF>" at line 1, column 5. Was expecting one of: <AND> ... <OR> ... <NOT> ... "+" ... "-" ... "(" ... ")" ... "*" ... "^" ... <QUOTED> ... <TERM> ... <FUZZY_SLOP> ... <PREFIXTERM> ... <WILDTERM> ... "[" ... "{" ... <NUMBER> ...
> Here is the full exception from catalina.out:
> SEVERE: org.apache.solr.common.SolrException: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:110)
> 	at com.imdb.solr.handler.component.LoggingSearchHandler.handleRequestBody(LoggingSearchHandler.java:134)
> 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
> 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
> 	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303)
> 	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232)
> 	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:215)
> 	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
> 	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> 	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
> 	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:548)
> 	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
> 	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
> 	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
> 	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
> 	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
> 	at org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
> 	at org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
> 	at org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
> 	at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
> 	at java.lang.Thread.run(Unknown Source)
> Caused by: org.apache.lucene.queryParser.ParseException: Cannot parse '(geh?': Encountered "<EOF>" at line 1, column 5.
> Was expecting one of:
>     <AND> ...
>     <OR> ...
>     <NOT> ...
>     "+" ...
>     "-" ...
>     "(" ...
>     ")" ...
>     "*" ...
>     "^" ...
>     <QUOTED> ...
>     <TERM> ...
>     <FUZZY_SLOP> ...
>     <PREFIXTERM> ...
>     <WILDTERM> ...
>     "[" ...
>     "{" ...
>     <NUMBER> ...
>     
> 	at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:177)
> 	at org.apache.solr.search.LuceneQParser.parse(LuceneQParserPlugin.java:78)
> 	at org.apache.solr.search.QParser.getQuery(QParser.java:126)
> 	at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:91)
> 	... 20 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.