You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by György Frivolt <gy...@gmail.com> on 2009/11/26 14:54:20 UTC

SolrException caused by illegal character

Hi,
    I upgradeed to Solr 1.4 and tried to reindex the data. After few
thousand of reindexed documents an exception is thrown, I did not meet
this using 1.3 before. Do you have any idea what caused the problem?
Thanks.

SEVERE: org.apache.solr.common.SolrException: Illegal character
((CTRL-CHAR, code 3))
 at [row,col {unknown-source}]: [6495,39]
	at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:72)
	at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1316)
	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:338)
	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:241)
	at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1089)
	at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:365)
	at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
	at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181)
	at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:712)
	at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:405)
	at org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:211)
	at org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
	at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:139)
	at org.mortbay.jetty.Server.handle(Server.java:285)
	at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:502)
	at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:835)
	at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:641)
	at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:208)
	at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:378)
	at org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:226)
	at org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:442)
Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal
character ((CTRL-CHAR, code 3))
 at [row,col {unknown-source}]: [6495,39]
	at com.ctc.wstx.sr.StreamScanner.throwInvalidSpace(StreamScanner.java:675)
	at com.ctc.wstx.sr.BasicStreamReader.readTextPrimary(BasicStreamReader.java:4556)
	at com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2888)
	at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
	at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:273)
	at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:138)
	at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69)
	... 22 more

Re: SolrException caused by illegal character

Posted by György Frivolt <fi...@gmail.com>.
Thanks, I also found out, had to filter my data. Now I removed the
control chars.. and solr is happy like I am.

On Sat, Nov 28, 2009 at 5:13 AM, Otis Gospodnetic
<ot...@yahoo.com> wrote:
> Could it be that your XML contains a .... control character, code 3? ;)
>
> Check the table on http://en.wikipedia.org/wiki/ASCII
>
> Otis
> --
> Sematext is hiring -- http://sematext.com/about/jobs.html?mls
> Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR
>
>
>
> ----- Original Message ----
>> From: György Frivolt <gy...@gmail.com>
>> To: solr-user <so...@lucene.apache.org>
>> Sent: Thu, November 26, 2009 8:54:20 AM
>> Subject: SolrException caused by illegal character
>>
>> Hi,
>>     I upgradeed to Solr 1.4 and tried to reindex the data. After few
>> thousand of reindexed documents an exception is thrown, I did not meet
>> this using 1.3 before. Do you have any idea what caused the problem?
>> Thanks.
>>
>> SEVERE: org.apache.solr.common.SolrException: Illegal character
>> ((CTRL-CHAR, code 3))
>> at [row,col {unknown-source}]: [6495,39]
>>     at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:72)
>>     at
>> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
>>     at
>> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
>>     at org.apache.solr.core.SolrCore.execute(SolrCore.java:1316)
>>     at
>> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:338)
>>     at
>> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:241)
>>     at
>> org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1089)
>>     at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:365)
>>     at
>> org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
>>     at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181)
>>     at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:712)
>>     at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:405)
>>     at
>> org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:211)
>>     at
>> org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
>>     at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:139)
>>     at org.mortbay.jetty.Server.handle(Server.java:285)
>>     at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:502)
>>     at
>> org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:835)
>>     at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:641)
>>     at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:208)
>>     at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:378)
>>     at
>> org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:226)
>>     at
>> org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:442)
>> Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal
>> character ((CTRL-CHAR, code 3))
>> at [row,col {unknown-source}]: [6495,39]
>>     at com.ctc.wstx.sr.StreamScanner.throwInvalidSpace(StreamScanner.java:675)
>>     at
>> com.ctc.wstx.sr.BasicStreamReader.readTextPrimary(BasicStreamReader.java:4556)
>>     at
>> com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2888)
>>     at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
>>     at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:273)
>>     at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:138)
>>     at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69)
>>     ... 22 more
>
>

Re: SolrException caused by illegal character

Posted by Otis Gospodnetic <ot...@yahoo.com>.
Could it be that your XML contains a .... control character, code 3? ;)

Check the table on http://en.wikipedia.org/wiki/ASCII  

Otis
--
Sematext is hiring -- http://sematext.com/about/jobs.html?mls
Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR



----- Original Message ----
> From: György Frivolt <gy...@gmail.com>
> To: solr-user <so...@lucene.apache.org>
> Sent: Thu, November 26, 2009 8:54:20 AM
> Subject: SolrException caused by illegal character
> 
> Hi,
>     I upgradeed to Solr 1.4 and tried to reindex the data. After few
> thousand of reindexed documents an exception is thrown, I did not meet
> this using 1.3 before. Do you have any idea what caused the problem?
> Thanks.
> 
> SEVERE: org.apache.solr.common.SolrException: Illegal character
> ((CTRL-CHAR, code 3))
> at [row,col {unknown-source}]: [6495,39]
>     at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:72)
>     at 
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
>     at 
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
>     at org.apache.solr.core.SolrCore.execute(SolrCore.java:1316)
>     at 
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:338)
>     at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:241)
>     at 
> org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1089)
>     at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:365)
>     at 
> org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
>     at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181)
>     at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:712)
>     at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:405)
>     at 
> org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:211)
>     at 
> org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
>     at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:139)
>     at org.mortbay.jetty.Server.handle(Server.java:285)
>     at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:502)
>     at 
> org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:835)
>     at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:641)
>     at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:208)
>     at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:378)
>     at 
> org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:226)
>     at 
> org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:442)
> Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal
> character ((CTRL-CHAR, code 3))
> at [row,col {unknown-source}]: [6495,39]
>     at com.ctc.wstx.sr.StreamScanner.throwInvalidSpace(StreamScanner.java:675)
>     at 
> com.ctc.wstx.sr.BasicStreamReader.readTextPrimary(BasicStreamReader.java:4556)
>     at 
> com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2888)
>     at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
>     at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:273)
>     at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:138)
>     at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69)
>     ... 22 more