You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by jeffersonzhou <je...@gmail.com> on 2011/08/16 11:59:27 UTC
Reducer failed when nutch and hadoop work togather
Hi,
I need help to figure out why reducer failed. I am using nutch 1.2 and the
hadoop shipped with nutch 1.2. I was using
http://wiki.apache.org/nutch/NutchHadoopTutorial to configure the two.
Below is the information:
Hadoop Map/Reduce History Viewer
_____
Available History
Available Jobs
Job tracker Host Name
Job tracker Start time
Job Id
Name
User
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0001
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
1&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0001_nutch_inject%2Burls>
inject urls
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0002
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
2&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0002_nutch_crawldb%2Bmit%252Fcrawldb>
crawldb mit/crawldb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0003
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
3&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0003_nutch_generate%253A%2Bselect%2Bfrom%2Bmit%252Fcrawldb>
generate: select from mit/crawldb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0004
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
4&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0004_nutch_generate%253A%2Bpartition%2Bmit%252Fsegments%252F201108161
60509>
generate: partition mit/segments/20110816160509
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0005
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
5&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0005_nutch_fetch%2Bmit%252Fsegments%252F20110816160509>
fetch mit/segments/20110816160509
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0006
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
6&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0006_nutch_crawldb%2Bmit%252Fcrawldb>
crawldb mit/crawldb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0007
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
7&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0007_nutch_linkdb%2Bmit%252Flinkdb>
linkdb mit/linkdb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0008
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
8&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0008_nutch_index-lucene%2Bmit%252Findexes>
index-lucene mit/indexes
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0009
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_000
9&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0009_nutch_dedup%2B1%253A%2Burls%2Bby%2Btime>
dedup 1: urls by time
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0010
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
0&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0010_nutch_dedup%2B2%253A%2Bcontent%2Bby%2Bhash>
dedup 2: content by hash
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0011
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
1&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0011_nutch_dedup%2B3%253A%2Bdelete%2Bfrom%2Bindex%2528es%2529>
dedup 3: delete from index(es)
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0012
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
2&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0012_nutch_inject%2Burls>
inject urls
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0013
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
3&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0013_nutch_crawldb%2Bmit%252Fcrawldb>
crawldb mit/crawldb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0014
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
4&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0014_nutch_generate%253A%2Bselect%2Bfrom%2Bmit%252Fcrawldb>
generate: select from mit/crawldb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0015
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
5&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0015_nutch_generate%253A%2Bpartition%2Bmit%252Fsegments%252F201108161
62211>
generate: partition mit/segments/20110816162211
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0016
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
6&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0016_nutch_fetch%2Bmit%252Fsegments%252F20110816162211>
fetch mit/segments/20110816162211
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0017
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
7&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0017_nutch_crawldb%2Bmit%252Fcrawldb>
crawldb mit/crawldb
nutch
localhost
Tue Aug 16 15:59:38 GMT 2011
job_201108161559_0018
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_001
8&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_201108
161559_0018_nutch_dump%2Bmit%252Fcrawldb>
dump mit/crawldb
nutch
master
Tue Aug 16 16:43:27 GMT 2011
job_201108161643_0001
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161643_000
1&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
643_0001_nutch_inject%2Burls>
inject urls
nutch
The error came from the last row: inject urls. The inject details are:
Hadoop Job job_201108161643_0001 on History Viewer
<http://192.168.1.116:50030/jobhistory.jsp>
User: nutch
JobName: inject urls
JobConf:
hdfs://master:9000/nutch/filesystem/mapreduce/system/job_201108161643_0001/j
ob.xml
<http://192.168.1.116:50030/jobconf_history.jsp?jobid=job_201108161643_0001&
jobLogDir=file:/nutch/search/logs/history&jobUniqueString=master_13135130071
44_job_201108161643_0001>
Submitted At: 16-Aug-2011 16:45:11
Launched At: 16-Aug-2011 16:45:15 (4sec)
Finished At: 16-Aug-2011 16:46:22 (1mins, 7sec)
Status: FAILED
Analyse This Job
<http://192.168.1.116:50030/analysejobhistory.jsp?jobid=job_201108161643_000
1&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
643_0001_nutch_inject%2Burls>
_____
Kind
Total Tasks(successful+failed+killed)
Successful tasks
Failed tasks
Killed tasks
Start Time
Finish Time
Setup
1
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=SETUP&status=all>
1
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=SETUP&status=SUCCESS>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=SETUP&status=FAILED>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=SETUP&status=KILLED>
16-Aug-2011 16:45:39
16-Aug-2011 16:45:41 (1sec)
Map
3
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=MAP&status=all>
3
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=MAP&status=SUCCESS>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=MAP&status=FAILED>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=MAP&status=KILLED>
16-Aug-2011 16:45:42
16-Aug-2011 16:46:17 (34sec)
Reduce
8
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=REDUCE&status=all>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=REDUCE&status=SUCCESS>
8
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=REDUCE&status=FAILED>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=REDUCE&status=KILLED>
16-Aug-2011 16:45:58
16-Aug-2011 16:46:36 (37sec)
Cleanup
1
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=all>
1
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=SUCCESS>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=FAILED>
0
<http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001&
logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816164
3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=KILLED>
16-Aug-2011 16:46:37
16-Aug-2011 16:46:39 (1sec)
Failed tasks attempts by nodes
Hostname
Failed Tasks
slave_1
task_201108161643_0001_r_000000
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000> ,
task_201108161643_0001_r_000001
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001> ,
slave_2
task_201108161643_0001_r_000000
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000> ,
task_201108161643_0001_r_000001
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001> ,
The eight failed tasks are:
FAILED REDUCE task list for job_201108161643_0001
<http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161643_000
1&&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls>
Task Id
Start Time
Finish Time
Error
task_201108161643_0001_r_000000
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
16/08 16:45:58
16/08 16:46:06 (7sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000000
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
16/08 16:46:19
16/08 16:46:24 (4sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000000
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
16/08 16:46:25
16/08 16:46:30 (4sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000000
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
16/08 16:46:31
16/08 16:46:36 (4sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000001
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
16/08 16:46:08
16/08 16:46:16 (7sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000001
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
16/08 16:46:19
16/08 16:46:24 (5sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000001
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
16/08 16:46:25
16/08 16:46:30 (4sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
task_201108161643_0001_r_000001
<http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_00
01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_20110816
1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
16/08 16:46:31
16/08 16:46:35 (4sec)
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCo
mpletionEvents(ReduceTask.java:2683) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Redu
ceTask.java:2605)
Re: Reducer failed when nutch and hadoop work togather
Posted by Markus Jelsma <ma...@openindex.io>.
Can you search the internet for :
Error: java.lang.NullPointerException at
java.util.concurrent.ConcurrentHashMap.get
There are several pages on this subject of which at least one has a solution.
But i don't know if that only applies to an older Hadoop version.
On Tuesday 16 August 2011 11:59:27 jeffersonzhou wrote:
> Hi,
>
>
>
> I need help to figure out why reducer failed. I am using nutch 1.2 and the
> hadoop shipped with nutch 1.2. I was using
> http://wiki.apache.org/nutch/NutchHadoopTutorial to configure the two.
>
>
>
> Below is the information:
>
>
>
>
> Hadoop Map/Reduce History Viewer
>
> _____
>
>
> Available History
>
>
> Available Jobs
>
>
> Job tracker Host Name
>
> Job tracker Start time
>
> Job Id
>
> Name
>
> User
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0001
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 1&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0001_nutch_inject%2Burls>
>
> inject urls
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0002
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 2&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0002_nutch_crawldb%2Bmit%252Fcrawldb>
>
> crawldb mit/crawldb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0003
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 3&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0003_nutch_generate%253A%2Bselect%2Bfrom%2Bmit%252Fcrawldb>
>
> generate: select from mit/crawldb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0004
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 4&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08
> 161559_0004_nutch_generate%253A%2Bpartition%2Bmit%252Fsegments%252F2011081
> 61 60509>
>
> generate: partition mit/segments/20110816160509
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0005
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 5&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0005_nutch_fetch%2Bmit%252Fsegments%252F20110816160509>
>
> fetch mit/segments/20110816160509
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0006
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 6&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0006_nutch_crawldb%2Bmit%252Fcrawldb>
>
> crawldb mit/crawldb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0007
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 7&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0007_nutch_linkdb%2Bmit%252Flinkdb>
>
> linkdb mit/linkdb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0008
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 8&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0008_nutch_index-lucene%2Bmit%252Findexes>
>
> index-lucene mit/indexes
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0009
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 0
> 9&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0009_nutch_dedup%2B1%253A%2Burls%2Bby%2Btime>
>
> dedup 1: urls by time
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0010
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 0&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0010_nutch_dedup%2B2%253A%2Bcontent%2Bby%2Bhash>
>
> dedup 2: content by hash
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0011
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 1&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0011_nutch_dedup%2B3%253A%2Bdelete%2Bfrom%2Bindex%2528es%2529>
>
> dedup 3: delete from index(es)
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0012
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 2&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0012_nutch_inject%2Burls>
>
> inject urls
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0013
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 3&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0013_nutch_crawldb%2Bmit%252Fcrawldb>
>
> crawldb mit/crawldb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0014
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 4&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0014_nutch_generate%253A%2Bselect%2Bfrom%2Bmit%252Fcrawldb>
>
> generate: select from mit/crawldb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0015
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 5&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08
> 161559_0015_nutch_generate%253A%2Bpartition%2Bmit%252Fsegments%252F2011081
> 61 62211>
>
> generate: partition mit/segments/20110816162211
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0016
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 6&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0016_nutch_fetch%2Bmit%252Fsegments%252F20110816162211>
>
> fetch mit/segments/20110816162211
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0017
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 7&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0017_nutch_crawldb%2Bmit%252Fcrawldb>
>
> crawldb mit/crawldb
>
> nutch
>
>
>
> localhost
>
> Tue Aug 16 15:59:38 GMT 2011
>
> job_201108161559_0018
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161559_00
> 1
> 8&logFile=file:/nutch/search/logs/history/localhost_1313510378870_job_2011
> 08 161559_0018_nutch_dump%2Bmit%252Fcrawldb>
>
> dump mit/crawldb
>
> nutch
>
>
>
> master
>
> Tue Aug 16 16:43:27 GMT 2011
>
> job_201108161643_0001
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161643_00
> 0
> 1&logFile=file:/nutch/search/logs/history/master_1313513007144_job_2011081
> 61 643_0001_nutch_inject%2Burls>
>
> inject urls
>
> nutch
>
>
>
>
>
>
>
> The error came from the last row: inject urls. The inject details are:
>
>
>
>
> Hadoop Job job_201108161643_0001 on History Viewer
> <http://192.168.1.116:50030/jobhistory.jsp>
>
>
> User: nutch
> JobName: inject urls
> JobConf:
> hdfs://master:9000/nutch/filesystem/mapreduce/system/job_201108161643_0001/
> j ob.xml
> <http://192.168.1.116:50030/jobconf_history.jsp?jobid=job_201108161643_0001
> &
> jobLogDir=file:/nutch/search/logs/history&jobUniqueString=master_131351300
> 71 44_job_201108161643_0001>
> Submitted At: 16-Aug-2011 16:45:11
> Launched At: 16-Aug-2011 16:45:15 (4sec)
> Finished At: 16-Aug-2011 16:46:22 (1mins, 7sec)
> Status: FAILED
> Analyse This Job
> <http://192.168.1.116:50030/analysejobhistory.jsp?jobid=job_201108161643_00
> 0
> 1&logFile=file:/nutch/search/logs/history/master_1313513007144_job_2011081
> 61 643_0001_nutch_inject%2Burls>
>
> _____
>
>
> Kind
>
> Total Tasks(successful+failed+killed)
>
> Successful tasks
>
> Failed tasks
>
> Killed tasks
>
> Start Time
>
> Finish Time
>
>
> Setup
>
> 1
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=SETUP&status=all>
>
> 1
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=SETUP&status=SUCCESS>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=SETUP&status=FAILED>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=SETUP&status=KILLED>
>
> 16-Aug-2011 16:45:39
>
> 16-Aug-2011 16:45:41 (1sec)
>
>
> Map
>
> 3
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=MAP&status=all>
>
> 3
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=MAP&status=SUCCESS>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=MAP&status=FAILED>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=MAP&status=KILLED>
>
> 16-Aug-2011 16:45:42
>
> 16-Aug-2011 16:46:17 (34sec)
>
>
> Reduce
>
> 8
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=REDUCE&status=all>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=REDUCE&status=SUCCESS>
>
> 8
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=REDUCE&status=FAILED>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=REDUCE&status=KILLED>
>
> 16-Aug-2011 16:45:58
>
> 16-Aug-2011 16:46:36 (37sec)
>
>
> Cleanup
>
> 1
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=all>
>
> 1
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=SUCCESS>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=FAILED>
>
> 0
> <http://192.168.1.116:50030/jobtaskshistory.jsp?jobid=job_201108161643_0001
> &
> logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108161
> 64 3_0001_nutch_inject%2Burls&taskType=CLEANUP&status=KILLED>
>
> 16-Aug-2011 16:46:37
>
> 16-Aug-2011 16:46:39 (1sec)
>
>
>
>
> Failed tasks attempts by nodes
>
>
> Hostname
>
> Failed Tasks
>
>
> slave_1
>
> task_201108161643_0001_r_000000
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000> ,
> task_201108161643_0001_r_000001
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001> ,
>
>
> slave_2
>
> task_201108161643_0001_r_000000
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000> ,
> task_201108161643_0001_r_000001
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001> ,
>
>
>
>
>
> The eight failed tasks are:
>
>
>
>
> FAILED REDUCE task list for job_201108161643_0001
> <http://192.168.1.116:50030/jobdetailshistory.jsp?jobid=job_201108161643_00
> 0
> 1&&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls>
>
>
> Task Id
>
> Start Time
>
> Finish Time
>
> Error
>
>
> task_201108161643_0001_r_000000
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
>
> 16/08 16:45:58
>
> 16/08 16:46:06 (7sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000000
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
>
> 16/08 16:46:19
>
> 16/08 16:46:24 (4sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000000
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
>
> 16/08 16:46:25
>
> 16/08 16:46:30 (4sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000000
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000000>
>
> 16/08 16:46:31
>
> 16/08 16:46:36 (4sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000001
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
>
> 16/08 16:46:08
>
> 16/08 16:46:16 (7sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000001
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
>
> 16/08 16:46:19
>
> 16/08 16:46:24 (5sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000001
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
>
> 16/08 16:46:25
>
> 16/08 16:46:30 (4sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
>
>
> task_201108161643_0001_r_000001
> <http://192.168.1.116:50030/taskdetailshistory.jsp?jobid=job_201108161643_0
> 0
> 01&logFile=file:/nutch/search/logs/history/master_1313513007144_job_201108
> 16 1643_0001_nutch_inject%2Burls&taskid=task_201108161643_0001_r_000001>
>
> 16/08 16:46:31
>
> 16/08 16:46:35 (4sec)
>
> Error: java.lang.NullPointerException at
> java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:922) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapC
> o mpletionEvents(ReduceTask.java:2683) at
> org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(Red
> u ceTask.java:2605)
--
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350