You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by alessio crisantemi <al...@gmail.com> on 2012/03/03 12:32:32 UTC

nutch log

this is my nutch log after configured it for solr index:

2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: content
dest: content
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: site dest:
site
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: title dest:
title
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: host dest:
host
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: segment
dest: segment
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: boost dest:
boost
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: digest dest:
digest
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: tstamp dest:
tstamp
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: url dest: id
2012-03-03 12:20:25,520 INFO  solr.SolrMappingReader - source: url dest: url
2012-03-03 12:20:25,707 INFO  solr.SolrWriter - Adding 11 documents
2012-03-03 12:20:26,519 WARN  mapred.LocalJobRunner - job_local_0019
org.apache.solr.common.SolrException: Internal Server Error
Internal Server Error
request: http://localhost:8983/solr/update?wt=javabin&version=2
 at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
 at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
 at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
 at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
 at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:93)
 at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
 at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
 at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
2012-03-03 12:20:27,377 ERROR solr.SolrIndexer - java.io.IOException: Job
failed!
2012-03-03 12:20:27,393 INFO  solr.SolrDeleteDuplicates -
SolrDeleteDuplicates: starting at 2012-03-03 12:20:27
2012-03-03 12:20:27,393 INFO  solr.SolrDeleteDuplicates -
SolrDeleteDuplicates: Solr url: http://localhost:8983/solr/
suggestions?
thanks
alessio

Re: nutch log

Posted by alessio crisantemi <al...@gmail.com>.
thanks koji, but i don't comprend ho can i do..

Il giorno 04 marzo 2012 06:31, Koji Sekiguchi <ko...@r.email.ne.jp> ha
scritto:

> It is not solr error. Consult nutch/hadoop mailing list.
>
>
> koji
> --
> Query Log Visualizer for Apache Solr
> http://soleami.com/
>
> (12/03/04 2:38), alessio crisantemi wrote:
>
>> now,
>>  I solve the boolean problem.
>>
>> but my indexing don't works now also..
>>
>> But this time, I don't have error in tomcat log and not error in nutch
>> log.
>> I see only this code on cygwin window:
>>
>> Exception in thread "main" org.apache.hadoop.mapred.**
>> InvalidInputException:
>> Input path does not exist:
>> file:/C:/temp/apache-nutch-1.**4-bin/runtime/local/crawl/**
>> segments/20120303171628/parse_**data
>>
>> at
>> org.apache.hadoop.mapred.**FileInputFormat.listStatus(**
>> FileInputFormat.java:190)
>>
>> at
>> org.apache.hadoop.mapred.**SequenceFileInputFormat.**listStatus(**
>> SequenceFileInputFormat.java:**44)
>>
>> at
>> org.apache.hadoop.mapred.**FileInputFormat.getSplits(**
>> FileInputFormat.java:201)
>>
>> at org.apache.hadoop.mapred.**JobClient.writeOldSplits(**
>> JobClient.java:810)
>>
>> at org.apache.hadoop.mapred.**JobClient.submitJobInternal(**
>> JobClient.java:781)
>>
>> at org.apache.hadoop.mapred.**JobClient.submitJob(JobClient.**java:730)
>>
>> at org.apache.hadoop.mapred.**JobClient.runJob(JobClient.**java:1249)
>>
>> at org.apache.nutch.crawl.LinkDb.**invert(LinkDb.java:175)
>>
>> at org.apache.nutch.crawl.LinkDb.**invert(LinkDb.java:149)
>>
>> at org.apache.nutch.crawl.Crawl.**run(Crawl.java:143)
>>
>> at org.apache.hadoop.util.**ToolRunner.run(ToolRunner.**java:65)
>>
>> at org.apache.nutch.crawl.Crawl.**main(Crawl.java:55)
>>
>>
>> why, in your opinion?
>> thanks again
>> alessio
>> Il giorno 03 marzo 2012 16:43, Koji Sekiguchi<ko...@r.email.ne.jp>  ha
>> scritto:
>>
>>  (12/03/04 0:09), alessio crisantemi wrote:
>>>
>>>  is true.
>>>> this is the slr problem:
>>>> mar 03, 2012 12:08:04 PM org.apache.solr.common.****SolrException log
>>>> Grave: org.apache.solr.common.****SolrException: invalid boolean value:
>>>>
>>>>
>>> Solr said that there was an erroneous boolean value in your
>>> solrconfig.xml.
>>> Check the values of<bool>...</bool>  of your solr plugins in
>>> solrconfig.xml.
>>> Those should be one of true/false/on/off/...
>>>
>>>
>>> koji
>>> --
>>> Query Log Visualizer for Apache Solr
>>> http://soleami.com/
>>>
>>>
>>

Re: nutch log

Posted by Koji Sekiguchi <ko...@r.email.ne.jp>.
It is not solr error. Consult nutch/hadoop mailing list.

koji
-- 
Query Log Visualizer for Apache Solr
http://soleami.com/

(12/03/04 2:38), alessio crisantemi wrote:
> now,
>   I solve the boolean problem.
>
> but my indexing don't works now also..
>
> But this time, I don't have error in tomcat log and not error in nutch log.
> I see only this code on cygwin window:
>
> Exception in thread "main" org.apache.hadoop.mapred.InvalidInputException:
> Input path does not exist:
> file:/C:/temp/apache-nutch-1.4-bin/runtime/local/crawl/segments/20120303171628/parse_data
>
> at
> org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:190)
>
> at
> org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:44)
>
> at
> org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201)
>
> at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
>
> at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
>
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
>
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
>
> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:175)
>
> at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:149)
>
> at org.apache.nutch.crawl.Crawl.run(Crawl.java:143)
>
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>
> at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
>
>
> why, in your opinion?
> thanks again
> alessio
> Il giorno 03 marzo 2012 16:43, Koji Sekiguchi<ko...@r.email.ne.jp>  ha
> scritto:
>
>> (12/03/04 0:09), alessio crisantemi wrote:
>>
>>> is true.
>>> this is the slr problem:
>>> mar 03, 2012 12:08:04 PM org.apache.solr.common.**SolrException log
>>> Grave: org.apache.solr.common.**SolrException: invalid boolean value:
>>>
>>
>> Solr said that there was an erroneous boolean value in your solrconfig.xml.
>> Check the values of<bool>...</bool>  of your solr plugins in
>> solrconfig.xml.
>> Those should be one of true/false/on/off/...
>>
>>
>> koji
>> --
>> Query Log Visualizer for Apache Solr
>> http://soleami.com/
>>
>

Fwd: nutch log

Posted by alessio crisantemi <al...@gmail.com>.
hi all,
i use solr 1.4.1 and nutch 1.4 .
I don't have error in tomcat log and not error in nutch log.
I see only this code on cygwin window:

Exception in thread "main" org.apache.hadoop.mapred.InvalidInputException:
Input path does not exist:
file:/C:/temp/apache-nutch-1.4-bin/runtime/local/crawl/segments/20120303171628/parse_data

at
org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:190)

at
org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:44)

at
org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201)

at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)

at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)

at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)

at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)

at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:175)

at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:149)

at org.apache.nutch.crawl.Crawl.run(Crawl.java:143)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)


why, in your opinion?
thanks again
alessio
Il giorno 03 marzo 2012 16:43, Koji Sekiguchi <ko...@r.email.ne.jp> ha
scritto:

(12/03/04 0:09), alessio crisantemi wrote:
>
>> is true.
>> this is the slr problem:
>> mar 03, 2012 12:08:04 PM org.apache.solr.common.**SolrException log
>> Grave: org.apache.solr.common.**SolrException: invalid boolean value:
>>
>
> Solr said that there was an erroneous boolean value in your solrconfig.xml.
> Check the values of <bool>...</bool> of your solr plugins in
> solrconfig.xml.
> Those should be one of true/false/on/off/...
>
>
> koji
> --
> Query Log Visualizer for Apache Solr
> http://soleami.com/
>

Re: nutch log

Posted by alessio crisantemi <al...@gmail.com>.
now,
 I solve the boolean problem.

but my indexing don't works now also..

But this time, I don't have error in tomcat log and not error in nutch log.
I see only this code on cygwin window:

Exception in thread "main" org.apache.hadoop.mapred.InvalidInputException:
Input path does not exist:
file:/C:/temp/apache-nutch-1.4-bin/runtime/local/crawl/segments/20120303171628/parse_data

at
org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:190)

at
org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:44)

at
org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201)

at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)

at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)

at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)

at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)

at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:175)

at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:149)

at org.apache.nutch.crawl.Crawl.run(Crawl.java:143)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)


why, in your opinion?
thanks again
alessio
Il giorno 03 marzo 2012 16:43, Koji Sekiguchi <ko...@r.email.ne.jp> ha
scritto:

> (12/03/04 0:09), alessio crisantemi wrote:
>
>> is true.
>> this is the slr problem:
>> mar 03, 2012 12:08:04 PM org.apache.solr.common.**SolrException log
>> Grave: org.apache.solr.common.**SolrException: invalid boolean value:
>>
>
> Solr said that there was an erroneous boolean value in your solrconfig.xml.
> Check the values of <bool>...</bool> of your solr plugins in
> solrconfig.xml.
> Those should be one of true/false/on/off/...
>
>
> koji
> --
> Query Log Visualizer for Apache Solr
> http://soleami.com/
>

Re: nutch log

Posted by Koji Sekiguchi <ko...@r.email.ne.jp>.
(12/03/04 0:09), alessio crisantemi wrote:
> is true.
> this is the slr problem:
> mar 03, 2012 12:08:04 PM org.apache.solr.common.SolrException log
> Grave: org.apache.solr.common.SolrException: invalid boolean value:

Solr said that there was an erroneous boolean value in your solrconfig.xml.
Check the values of <bool>...</bool> of your solr plugins in solrconfig.xml.
Those should be one of true/false/on/off/...

koji
-- 
Query Log Visualizer for Apache Solr
http://soleami.com/

Re: nutch log

Posted by Markus Jelsma <ma...@openindex.io>.
 Looks like you have a bad value where a boolean is expected in your 
 solrconfig.xml.

 On Sat, 3 Mar 2012 16:09:11 +0100, alessio crisantemi 
 <al...@gmail.com> wrote:
> is true.
> this is the slr problem:
> mar 03, 2012 12:08:04 PM org.apache.solr.common.SolrException log
> Grave: org.apache.solr.common.SolrException: invalid boolean value:
>  at org.apache.solr.common.util.StrUtils.parseBool(StrUtils.java:237)
>  at 
> org.apache.solr.common.util.DOMUtil.addToNamedList(DOMUtil.java:140)
>  at 
> org.apache.solr.common.util.DOMUtil.nodesToNamedList(DOMUtil.java:98)
>  at
> 
> org.apache.solr.common.util.DOMUtil.childNodesToNamedList(DOMUtil.java:88)
>  at 
> org.apache.solr.common.util.DOMUtil.addToNamedList(DOMUtil.java:142)
>  at 
> org.apache.solr.common.util.DOMUtil.nodesToNamedList(DOMUtil.java:98)
>  at
> 
> org.apache.solr.common.util.DOMUtil.childNodesToNamedList(DOMUtil.java:88)
>  at org.apache.solr.core.PluginInfo.<init>(PluginInfo.java:54)
>  at 
> org.apache.solr.core.SolrConfig.readPluginInfos(SolrConfig.java:220)
>  at 
> org.apache.solr.core.SolrConfig.loadPluginInfo(SolrConfig.java:212)
>  at org.apache.solr.core.SolrConfig.<init>(SolrConfig.java:184)
>  at
> 
> org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:134)
>  at
> 
> org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:83)
>  at
> 
> org.apache.catalina.core.ApplicationFilterConfig.initFilter(ApplicationFilterConfig.java:277)
>  at
> 
> org.apache.catalina.core.ApplicationFilterConfig.getFilter(ApplicationFilterConfig.java:258)
>  at
> 
> org.apache.catalina.core.ApplicationFilterConfig.setFilterDef(ApplicationFilterConfig.java:382)
>  at
> 
> org.apache.catalina.core.ApplicationFilterConfig.<init>(ApplicationFilterConfig.java:103)
>  at
> 
> org.apache.catalina.core.StandardContext.filterStart(StandardContext.java:4624)
>  at
> 
> org.apache.catalina.core.StandardContext.startInternal(StandardContext.java:5281)
>  at 
> org.apache.catalina.util.LifecycleBase.start(LifecycleBase.java:150)
>  at
> 
> org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:866)
>  at 
> org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:842)
>  at 
> org.apache.catalina.core.StandardHost.addChild(StandardHost.java:615)
>  at
> 
> org.apache.catalina.startup.HostConfig.deployDescriptor(HostConfig.java:649)
>  at
> 
> org.apache.catalina.startup.HostConfig$DeployDescriptor.run(HostConfig.java:1581)
>  at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
>  at 
> java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
>  at java.util.concurrent.FutureTask.run(FutureTask.java:166)
>  at
> 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>  at
> 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
>  at java.lang.Thread.run(Thread.java:722)
> whats means?
> thanks
> a.
>
> Il giorno 03 marzo 2012 14:40, Koji Sekiguchi <ko...@r.email.ne.jp> ha
> scritto:
>
>> (12/03/03 20:32), alessio crisantemi wrote:
>>
>>> this is my nutch log after configured it for solr index:
>>>
>>>  :
>>
>>> org.apache.solr.common.**SolrException: Internal Server Error
>>> Internal Server Error
>>> request: 
>>> http://localhost:8983/solr/**update?wt=javabin&version=2<http://localhost:8983/solr/update?wt=javabin&version=2>
>>>  at
>>> org.apache.solr.client.solrj.**impl.CommonsHttpSolrServer.**
>>> request(CommonsHttpSolrServer.**java:430)
>>>
>> :
>>
>>> suggestions?
>>> thanks
>>> alessio
>>>
>> Hi alessio,
>>
>> I have no ideas for nutch, but I think you can look for the cause of 
>> the
>> internal server
>> error in Solr log, not in nutch log.
>>
>> koji
>> --
>> Query Log Visualizer for Apache Solr
>> http://soleami.com/
>>

-- 

Re: nutch log

Posted by alessio crisantemi <al...@gmail.com>.
is true.
this is the slr problem:
mar 03, 2012 12:08:04 PM org.apache.solr.common.SolrException log
Grave: org.apache.solr.common.SolrException: invalid boolean value:
 at org.apache.solr.common.util.StrUtils.parseBool(StrUtils.java:237)
 at org.apache.solr.common.util.DOMUtil.addToNamedList(DOMUtil.java:140)
 at org.apache.solr.common.util.DOMUtil.nodesToNamedList(DOMUtil.java:98)
 at
org.apache.solr.common.util.DOMUtil.childNodesToNamedList(DOMUtil.java:88)
 at org.apache.solr.common.util.DOMUtil.addToNamedList(DOMUtil.java:142)
 at org.apache.solr.common.util.DOMUtil.nodesToNamedList(DOMUtil.java:98)
 at
org.apache.solr.common.util.DOMUtil.childNodesToNamedList(DOMUtil.java:88)
 at org.apache.solr.core.PluginInfo.<init>(PluginInfo.java:54)
 at org.apache.solr.core.SolrConfig.readPluginInfos(SolrConfig.java:220)
 at org.apache.solr.core.SolrConfig.loadPluginInfo(SolrConfig.java:212)
 at org.apache.solr.core.SolrConfig.<init>(SolrConfig.java:184)
 at
org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:134)
 at
org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:83)
 at
org.apache.catalina.core.ApplicationFilterConfig.initFilter(ApplicationFilterConfig.java:277)
 at
org.apache.catalina.core.ApplicationFilterConfig.getFilter(ApplicationFilterConfig.java:258)
 at
org.apache.catalina.core.ApplicationFilterConfig.setFilterDef(ApplicationFilterConfig.java:382)
 at
org.apache.catalina.core.ApplicationFilterConfig.<init>(ApplicationFilterConfig.java:103)
 at
org.apache.catalina.core.StandardContext.filterStart(StandardContext.java:4624)
 at
org.apache.catalina.core.StandardContext.startInternal(StandardContext.java:5281)
 at org.apache.catalina.util.LifecycleBase.start(LifecycleBase.java:150)
 at
org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:866)
 at org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:842)
 at org.apache.catalina.core.StandardHost.addChild(StandardHost.java:615)
 at
org.apache.catalina.startup.HostConfig.deployDescriptor(HostConfig.java:649)
 at
org.apache.catalina.startup.HostConfig$DeployDescriptor.run(HostConfig.java:1581)
 at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
 at java.util.concurrent.FutureTask.run(FutureTask.java:166)
 at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
 at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
 at java.lang.Thread.run(Thread.java:722)
whats means?
thanks
a.

Il giorno 03 marzo 2012 14:40, Koji Sekiguchi <ko...@r.email.ne.jp> ha
scritto:

> (12/03/03 20:32), alessio crisantemi wrote:
>
>> this is my nutch log after configured it for solr index:
>>
>>  :
>
>> org.apache.solr.common.**SolrException: Internal Server Error
>> Internal Server Error
>> request: http://localhost:8983/solr/**update?wt=javabin&version=2<http://localhost:8983/solr/update?wt=javabin&version=2>
>>  at
>> org.apache.solr.client.solrj.**impl.CommonsHttpSolrServer.**
>> request(CommonsHttpSolrServer.**java:430)
>>
> :
>
>> suggestions?
>> thanks
>> alessio
>>
> Hi alessio,
>
> I have no ideas for nutch, but I think you can look for the cause of the
> internal server
> error in Solr log, not in nutch log.
>
> koji
> --
> Query Log Visualizer for Apache Solr
> http://soleami.com/
>

Re: nutch log

Posted by Koji Sekiguchi <ko...@r.email.ne.jp>.
(12/03/03 20:32), alessio crisantemi wrote:
> this is my nutch log after configured it for solr index:
>
:
> org.apache.solr.common.SolrException: Internal Server Error
> Internal Server Error
> request: http://localhost:8983/solr/update?wt=javabin&version=2
>   at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
:
> suggestions?
> thanks
> alessio
Hi alessio,

I have no ideas for nutch, but I think you can look for the cause of the internal server
error in Solr log, not in nutch log.

koji
-- 
Query Log Visualizer for Apache Solr
http://soleami.com/