You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Muwonge Ronald <ss...@gmail.com> on 2017/04/11 21:51:09 UTC

Nutch 2 and Cassandra 2 Problem!

Greetings,
any hint on why Iam failling at this level?
*bin/crawl urls/ 1 2*
No SOLRURL specified. Skipping indexing.
Injecting seed URLs
/usr/local/nutch/runtime/local/bin/nutch inject urls/ -crawlId 1
InjectorJob: starting at 2017-04-11 17:43:18
InjectorJob: Injecting urlDir: urls
InjectorJob: Using class org.apache.gora.cassandra.store.CassandraStore as
the Gora storage class.
InjectorJob: total number of urls rejected by filters: 0
InjectorJob: total number of urls injected after normalization and
filtering: 1
Injector: finished at 2017-04-11 17:43:24, elapsed: 00:00:05
Tue Apr 11 17:43:24 EDT 2017 : Iteration 1 of 2
Generating batchId
Generating a new fetchlist
/usr/local/nutch/runtime/local/bin/nutch generate -D mapred.reduce.tasks=2
-D mapred.child.java.opts=-Xmx1000m -D
mapred.reduce.tasks.speculative.execution=false -D
mapred.map.tasks.speculative.execution=false -D
mapred.compress.map.output=true -topN 50000 -noNorm -noFilter -adddays 0
-crawlId 1 -batchId 1491947004-17002
GeneratorJob: starting at 2017-04-11 17:43:26
GeneratorJob: Selecting best-scoring urls due for fetch.
GeneratorJob: starting
GeneratorJob: filtering: false
GeneratorJob: normalizing: false
GeneratorJob: topN: 50000
GeneratorJob: finished at 2017-04-11 17:43:32, time elapsed: 00:00:06
GeneratorJob: generated batch id: 1491947004-17002 containing 1 URLs
.
.
.
.
.
.

0/0 spinwaiting/active, 0 pages, 0 errors, 0.0 0 pages/s, 0 0 kb/s, 0 URLs
in 0 queues
-activeThreads=0
Exception in thread "main" java.lang.RuntimeException: job failed:
name=[1]fetch, jobid=job_local1649073953_0001
    at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:120)
    at org.apache.nutch.fetcher.FetcherJob.run(FetcherJob.java:205)
    at org.apache.nutch.fetcher.FetcherJob.fetch(FetcherJob.java:251)
    at org.apache.nutch.fetcher.FetcherJob.run(FetcherJob.java:314)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
    at org.apache.nutch.fetcher.FetcherJob.main(FetcherJob.java:321)
Error running:
  /usr/local/nutch/runtime/local/bin/nutch fetch -D mapred.reduce.tasks=2
-D mapred.child.java.opts=-Xmx1000m -D
mapred.reduce.tasks.speculative.execution=false -D
mapred.map.tasks.speculative.execution=false -D
mapred.compress.map.output=true -D fetcher.timelimit.mins=180
1491947147-19941 -crawlId 1 -threads 50
Failed with exit value 1.



-- 

*For in much wisdom is much grief: and he that increases knowledge
increases sorrow.*
King Solomon