You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Xue-Feng Yang <ju...@yahoo.com> on 2011/09/16 23:10:48 UTC

solr unicode problem in query page

I just started to test solr in Chinese content. In the admin page, I could use *:* and saw the correct result. However, when I tried some specific search, I came to no results.  


I don't know what the problem is. Any thought?


By the way, I have a few cores running in English without encoding problem. And in my testing application server, I have some applications which is OK with Chinese. 

Re: solr unicode problem in query page

Posted by Xue-Feng Yang <ju...@yahoo.com>.
Just a reminder: two returns contains different uni-code while my input doesn't change. You can see this in the following two messages.



________________________________
From: Xue-Feng Yang <ju...@yahoo.com>
To: "solr-user@lucene.apache.org" <so...@lucene.apache.org>
Sent: Saturday, September 17, 2011 9:18:54 AM
Subject: Re: solr unicode problem in query page

Here is the return by use of http://localhost:8080/solr/cuv/select/?q=content%3A%C9%CF%B5%DB&version=2.2&start=0&rows=10&indent=on&debugQuery=true

<response><lst name="responseHeader"><int name="status">0</int><int name="QTime">11</int><lst name="params"><str name="q">content:ÉϵÛ</str><str name="version">2.2</str><str name="start">0</str><str name="rows">10</str><str name="indent">on</str><str name="debugQuery">true</str></lst></lst><result name="response" numFound="0" start="0"/><lst name="debug"><str name="rawquerystring">content:ÉϵÛ</str><str name="querystring">content:ÉϵÛ</str><str name="parsedquery"/><str name="parsedquery_toString"/><lst name="explain"/><str name="QParser">LuceneQParser</str><lst name="timing"><double name="time">6.0</double><lst name="prepare"><double name="time">3.0</double><lst name="org.apache.solr.handler.component.QueryComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.FacetComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.MoreLikeThisComponent"><double
name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.HighlightComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.StatsComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.DebugComponent"><double name="time">0.0</double></lst></lst><lst name="process"><double name="time">1.0</double><lst name="org.apache.solr.handler.component.QueryComponent"><double name="time">1.0</double></lst><lst name="org.apache.solr.handler.component.FacetComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.MoreLikeThisComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.HighlightComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.StatsComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.DebugComponent"><double
name="time">0.0</double></lst></lst></lst></lst></response>

Basically, it doesn't say anything.


________________________________
From: Xue-Feng Yang <ju...@yahoo.com>
To: "solr-user@lucene.apache.org" <so...@lucene.apache.org>
Sent: Saturday, September 17, 2011 9:14:24 AM
Subject: Re: solr unicode problem in query page

Thanks-谢谢-Erik.

Actually, I just played with http://localhost:8080/solr/cuv/admin/

cuv is the Chinese Union version of Bible. I inputted "content:上帝" in Query String without quotations. It then directs to

http://localhost:8080/solr/cuv/select/?q=content%3A%E4%B8%8A%E5%B8%9D&version=2.2&start=0&rows=10&indent=on

The response XML is

<response><lst name="responseHeader"><int name="status">0</int><int name="QTime">1</int><lst name="params"><str name="q">content:ä¸å¸</str><str name="version">2.2</str><str name="start">0</str><str name="rows">10</str><str name="indent">on</str></lst></lst><result name="response" numFound="0" start="0"/></response>

There are no results. 


In addition to this, the query "*:*" returns correct Chinese. On the other hand, I have 26 versions of Bible in English which are OK so far. I also have some other applications in Chinese in the same glassfish domain. They don't have any trouble in language presentation.


Best,

Xue-Feng







________________________________
From: Erik Hatcher <er...@gmail.com>
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:43:15 AM
Subject: Re: solr unicode problem in query page

Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but searching on other strings is entirely dependent on your configuration and the exact query.  For us to help, you'll need to share your full query (debugQuery=true helps here) and the related field(s)/type(s) that are being queried.

    Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

> I just started to test solr in Chinese content. In the admin page, I could use *:* and saw the correct result. However, when I tried some specific search, I came to no results.  
> 
> 
> I don't know what the problem is. Any thought?
> 
> 
> By the way, I have a few cores running in English without encoding problem. And in my testing application server, I have some applications which is OK with Chinese. 

Re: solr unicode problem in query page

Posted by Xue-Feng Yang <ju...@yahoo.com>.
Here is the return by use of http://localhost:8080/solr/cuv/select/?q=content%3A%C9%CF%B5%DB&version=2.2&start=0&rows=10&indent=on&debugQuery=true

<response><lst name="responseHeader"><int name="status">0</int><int name="QTime">11</int><lst name="params"><str name="q">content:ÉϵÛ</str><str name="version">2.2</str><str name="start">0</str><str name="rows">10</str><str name="indent">on</str><str name="debugQuery">true</str></lst></lst><result name="response" numFound="0" start="0"/><lst name="debug"><str name="rawquerystring">content:ÉϵÛ</str><str name="querystring">content:ÉϵÛ</str><str name="parsedquery"/><str name="parsedquery_toString"/><lst name="explain"/><str name="QParser">LuceneQParser</str><lst name="timing"><double name="time">6.0</double><lst name="prepare"><double name="time">3.0</double><lst name="org.apache.solr.handler.component.QueryComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.FacetComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.MoreLikeThisComponent"><double
 name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.HighlightComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.StatsComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.DebugComponent"><double name="time">0.0</double></lst></lst><lst name="process"><double name="time">1.0</double><lst name="org.apache.solr.handler.component.QueryComponent"><double name="time">1.0</double></lst><lst name="org.apache.solr.handler.component.FacetComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.MoreLikeThisComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.HighlightComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.StatsComponent"><double name="time">0.0</double></lst><lst name="org.apache.solr.handler.component.DebugComponent"><double
 name="time">0.0</double></lst></lst></lst></lst></response>

Basically, it doesn't say anything.


________________________________
From: Xue-Feng Yang <ju...@yahoo.com>
To: "solr-user@lucene.apache.org" <so...@lucene.apache.org>
Sent: Saturday, September 17, 2011 9:14:24 AM
Subject: Re: solr unicode problem in query page

Thanks-谢谢-Erik.

Actually, I just played with http://localhost:8080/solr/cuv/admin/

cuv is the Chinese Union version of Bible. I inputted "content:上帝" in Query String without quotations. It then directs to

http://localhost:8080/solr/cuv/select/?q=content%3A%E4%B8%8A%E5%B8%9D&version=2.2&start=0&rows=10&indent=on

The response XML is

<response><lst name="responseHeader"><int name="status">0</int><int name="QTime">1</int><lst name="params"><str name="q">content:ä¸å¸</str><str name="version">2.2</str><str name="start">0</str><str name="rows">10</str><str name="indent">on</str></lst></lst><result name="response" numFound="0" start="0"/></response>

There are no results. 


In addition to this, the query "*:*" returns correct Chinese. On the other hand, I have 26 versions of Bible in English which are OK so far. I also have some other applications in Chinese in the same glassfish domain. They don't have any trouble in language presentation.


Best,

Xue-Feng







________________________________
From: Erik Hatcher <er...@gmail.com>
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:43:15 AM
Subject: Re: solr unicode problem in query page

Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but searching on other strings is entirely dependent on your configuration and the exact query.  For us to help, you'll need to share your full query (debugQuery=true helps here) and the related field(s)/type(s) that are being queried.

    Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

> I just started to test solr in Chinese content. In the admin page, I could use *:* and saw the correct result. However, when I tried some specific search, I came to no results.  
> 
> 
> I don't know what the problem is. Any thought?
> 
> 
> By the way, I have a few cores running in English without encoding problem. And in my testing application server, I have some applications which is OK with Chinese. 

Re: solr unicode problem in query page

Posted by Xue-Feng Yang <ju...@yahoo.com>.
Thanks-谢谢-Erik.

Actually, I just played with http://localhost:8080/solr/cuv/admin/

cuv is the Chinese Union version of Bible. I inputted "content:上帝" in Query String without quotations. It then directs to

http://localhost:8080/solr/cuv/select/?q=content%3A%E4%B8%8A%E5%B8%9D&version=2.2&start=0&rows=10&indent=on

The response XML is

<response><lst name="responseHeader"><int name="status">0</int><int name="QTime">1</int><lst name="params"><str name="q">content:ä¸å¸</str><str name="version">2.2</str><str name="start">0</str><str name="rows">10</str><str name="indent">on</str></lst></lst><result name="response" numFound="0" start="0"/></response>

There are no results. 


In addition to this, the query "*:*" returns correct Chinese. On the other hand, I have 26 versions of Bible in English which are OK so far. I also have some other applications in Chinese in the same glassfish domain. They don't have any trouble in language presentation.


Best,

Xue-Feng







________________________________
From: Erik Hatcher <er...@gmail.com>
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:43:15 AM
Subject: Re: solr unicode problem in query page

Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but searching on other strings is entirely dependent on your configuration and the exact query.  For us to help, you'll need to share your full query (debugQuery=true helps here) and the related field(s)/type(s) that are being queried.

    Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

> I just started to test solr in Chinese content. In the admin page, I could use *:* and saw the correct result. However, when I tried some specific search, I came to no results.  
> 
> 
> I don't know what the problem is. Any thought?
> 
> 
> By the way, I have a few cores running in English without encoding problem. And in my testing application server, I have some applications which is OK with Chinese. 

Re: solr unicode problem in query page

Posted by Erik Hatcher <er...@gmail.com>.
Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but searching on other strings is entirely dependent on your configuration and the exact query.  For us to help, you'll need to share your full query (debugQuery=true helps here) and the related field(s)/type(s) that are being queried.

	Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

> I just started to test solr in Chinese content. In the admin page, I could use *:* and saw the correct result. However, when I tried some specific search, I came to no results.  
> 
> 
> I don't know what the problem is. Any thought?
> 
> 
> By the way, I have a few cores running in English without encoding problem. And in my testing application server, I have some applications which is OK with Chinese.