You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Trần Anh Tuấn <tk...@gmail.com> on 2012/08/07 06:35:37 UTC

Nutch 2.x with Cloudera CDH 4 get Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected

I'm trying to crawl using Nutch 2.

I check out source from
http://svn.apache.org/repos/asf/nutch/branches/2.x/ and config with
mysql.

I get error but when run nutch 1.5 everything okie :(

mkdir urls
echo nutch.apache.org > urls/seed.txt
runtime/deploy/bin/nutch inject urls
12/08/07 11:25:38 INFO crawl.InjectorJob: InjectorJob: starting
12/08/07 11:25:38 INFO crawl.InjectorJob: InjectorJob: urlDir: urls
12/08/07 11:25:41 WARN mapred.JobClient: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the
same.
12/08/07 11:25:44 INFO input.FileInputFormat: Total input paths to process : 1
12/08/07 11:25:45 INFO util.NativeCodeLoader: Loaded the native-hadoop library
12/08/07 11:25:45 WARN snappy.LoadSnappy: Snappy native library is available
12/08/07 11:25:45 INFO snappy.LoadSnappy: Snappy native
12/08/07 11:25:47 INFO mapred.JobClient:  map 0% reduce 0%
12/08/07 11:26:01 INFO mapred.JobClient: Task Id :
attempt_201208071123_0001_m_000000_0, Status : FAILED
Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext,
but class was expected
attempt_201208071123_0001_m_000000_0: SLF4J: Class path contains
multiple SLF4J bindings.
attempt_201208071123_0001_m_000000_0: SLF4J: Found binding in
[jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
attempt_201208071123_0001_m_000000_0: SLF4J: Found binding in
[jar:file:/var/lib/hadoop-hdfs/cache/mapred/mapred/local/taskTracker/root/jobcache/job_201208071123_0001/jars/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
attempt_201208071123_0001_m_000000_0: SLF4J: See
http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
12/08/07 11:26:05 INFO mapred.JobClient: Task Id :
attempt_201208071123_0001_m_000000_1, Status : FAILED
Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext,
but class was expected
attempt_201208071123_0001_m_000000_1: SLF4J: Class path contains
multiple SLF4J bindings.
attempt_201208071123_0001_m_000000_1: SLF4J: Found binding in
[jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
attempt_201208071123_0001_m_000000_1: SLF4J: Found binding in
[jar:file:/var/lib/hadoop-hdfs/cache/mapred/mapred/local/taskTracker/root/jobcache/job_201208071123_0001/jars/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
attempt_201208071123_0001_m_000000_1: SLF4J: See
http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
12/08/07 11:26:10 INFO mapred.JobClient: Task Id :
attempt_201208071123_0001_m_000000_2, Status : FAILED
Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext,
but class was expected
attempt_201208071123_0001_m_000000_2: SLF4J: Class path contains
multiple SLF4J bindings.
attempt_201208071123_0001_m_000000_2: SLF4J: Found binding in
[jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
attempt_201208071123_0001_m_000000_2: SLF4J: Found binding in
[jar:file:/var/lib/hadoop-hdfs/cache/mapred/mapred/local/taskTracker/root/jobcache/job_201208071123_0001/jars/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
attempt_201208071123_0001_m_000000_2: SLF4J: See
http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
12/08/07 11:26:19 INFO mapred.JobClient: Job complete: job_201208071123_0001
12/08/07 11:26:19 INFO mapred.JobClient: Counters: 7
12/08/07 11:26:19 INFO mapred.JobClient:   Job Counters
12/08/07 11:26:19 INFO mapred.JobClient:     Failed map tasks=1
12/08/07 11:26:19 INFO mapred.JobClient:     Launched map tasks=4
12/08/07 11:26:19 INFO mapred.JobClient:     Data-local map tasks=4
12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
maps in occupied slots (ms)=18003
12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
reduces in occupied slots (ms)=0
12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
maps waiting after reserving slots (ms)=0
12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
reduces waiting after reserving slots (ms)=0
12/08/07 11:26:19 ERROR crawl.InjectorJob: InjectorJob:
java.lang.RuntimeException: job failed: name=inject-p1 urls,
jobid=job_201208071123_0001
	at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:47)
	at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:248)
	at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:268)
	at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:288)
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
	at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:298)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.hadoop.util.RunJar.main(RunJar.java:208)

Thanks

Trần Anh Tuấn.

Phone:  0989896118
Yahoo:   tk1cntt
Skype:   tk1cntt

Re: Nutch 2.x with Cloudera CDH 4 get Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext, but class was expected

Posted by Ferdy Galema <fe...@kalooga.com>.
Hi,

CDH4 is not yet officially supported, but CDH3 is. If you get CDH4 working,
it would be great if you would post a patch or howto.

Ferdy.

On Tue, Aug 7, 2012 at 6:35 AM, Trần Anh Tuấn <tk...@gmail.com> wrote:

> I'm trying to crawl using Nutch 2.
>
> I check out source from
> http://svn.apache.org/repos/asf/nutch/branches/2.x/ and config with
> mysql.
>
> I get error but when run nutch 1.5 everything okie :(
>
> mkdir urls
> echo nutch.apache.org > urls/seed.txt
> runtime/deploy/bin/nutch inject urls
> 12/08/07 11:25:38 INFO crawl.InjectorJob: InjectorJob: starting
> 12/08/07 11:25:38 INFO crawl.InjectorJob: InjectorJob: urlDir: urls
> 12/08/07 11:25:41 WARN mapred.JobClient: Use GenericOptionsParser for
> parsing the arguments. Applications should implement Tool for the
> same.
> 12/08/07 11:25:44 INFO input.FileInputFormat: Total input paths to process
> : 1
> 12/08/07 11:25:45 INFO util.NativeCodeLoader: Loaded the native-hadoop
> library
> 12/08/07 11:25:45 WARN snappy.LoadSnappy: Snappy native library is
> available
> 12/08/07 11:25:45 INFO snappy.LoadSnappy: Snappy native
> 12/08/07 11:25:47 INFO mapred.JobClient:  map 0% reduce 0%
> 12/08/07 11:26:01 INFO mapred.JobClient: Task Id :
> attempt_201208071123_0001_m_000000_0, Status : FAILED
> Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext,
> but class was expected
> attempt_201208071123_0001_m_000000_0: SLF4J: Class path contains
> multiple SLF4J bindings.
> attempt_201208071123_0001_m_000000_0: SLF4J: Found binding in
>
> [jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> attempt_201208071123_0001_m_000000_0: SLF4J: Found binding in
>
> [jar:file:/var/lib/hadoop-hdfs/cache/mapred/mapred/local/taskTracker/root/jobcache/job_201208071123_0001/jars/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> attempt_201208071123_0001_m_000000_0: SLF4J: See
> http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
> 12/08/07 11:26:05 INFO mapred.JobClient: Task Id :
> attempt_201208071123_0001_m_000000_1, Status : FAILED
> Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext,
> but class was expected
> attempt_201208071123_0001_m_000000_1: SLF4J: Class path contains
> multiple SLF4J bindings.
> attempt_201208071123_0001_m_000000_1: SLF4J: Found binding in
>
> [jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> attempt_201208071123_0001_m_000000_1: SLF4J: Found binding in
>
> [jar:file:/var/lib/hadoop-hdfs/cache/mapred/mapred/local/taskTracker/root/jobcache/job_201208071123_0001/jars/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> attempt_201208071123_0001_m_000000_1: SLF4J: See
> http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
> 12/08/07 11:26:10 INFO mapred.JobClient: Task Id :
> attempt_201208071123_0001_m_000000_2, Status : FAILED
> Error: Found interface org.apache.hadoop.mapreduce.TaskAttemptContext,
> but class was expected
> attempt_201208071123_0001_m_000000_2: SLF4J: Class path contains
> multiple SLF4J bindings.
> attempt_201208071123_0001_m_000000_2: SLF4J: Found binding in
>
> [jar:file:/usr/lib/zookeeper/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> attempt_201208071123_0001_m_000000_2: SLF4J: Found binding in
>
> [jar:file:/var/lib/hadoop-hdfs/cache/mapred/mapred/local/taskTracker/root/jobcache/job_201208071123_0001/jars/lib/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> attempt_201208071123_0001_m_000000_2: SLF4J: See
> http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
> 12/08/07 11:26:19 INFO mapred.JobClient: Job complete:
> job_201208071123_0001
> 12/08/07 11:26:19 INFO mapred.JobClient: Counters: 7
> 12/08/07 11:26:19 INFO mapred.JobClient:   Job Counters
> 12/08/07 11:26:19 INFO mapred.JobClient:     Failed map tasks=1
> 12/08/07 11:26:19 INFO mapred.JobClient:     Launched map tasks=4
> 12/08/07 11:26:19 INFO mapred.JobClient:     Data-local map tasks=4
> 12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
> maps in occupied slots (ms)=18003
> 12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
> reduces in occupied slots (ms)=0
> 12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
> maps waiting after reserving slots (ms)=0
> 12/08/07 11:26:19 INFO mapred.JobClient:     Total time spent by all
> reduces waiting after reserving slots (ms)=0
> 12/08/07 11:26:19 ERROR crawl.InjectorJob: InjectorJob:
> java.lang.RuntimeException: job failed: name=inject-p1 urls,
> jobid=job_201208071123_0001
>         at
> org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:47)
>         at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:248)
>         at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:268)
>         at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:288)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>         at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:298)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.apache.hadoop.util.RunJar.main(RunJar.java:208)
>
> Thanks
>
> Trần Anh Tuấn.
>
> Phone:  0989896118
> Yahoo:   tk1cntt
> Skype:   tk1cntt
>