You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@nutch.apache.org by Jason Culverhouse <ja...@mischievous.org> on 2007/02/02 01:27:40 UTC

Nutch 0.9-dev trunk generate task failing/not completing

I have a 2 node testing cluster, I can  no longer run the generate  
task with out a log filled with errors (after about 19 successful  
iterations)

running
./bin/nutch generate crawl/crawldb testsegments -topN 10

generates the following
----
2007-02-01 15:28:37,510 WARN  fs.FSNamesystem - Replication requested  
of 10 is larger than cluster size (2). Using cluster size.
2007-02-01 15:28:37,510 DEBUG dfs.StateChange - DIR*  
NameSystem.startFile: add /home/nutch/nutch/filesystem/mapred/system/ 
submit_l03eux/.job.xml.crc to pendingCreates for DFSClient_-980095089
2007-02-01 15:28:37,511 DEBUG dfs.StateChange - BLOCK*  
NameSystem.allocateBlock: /home/nutch/nutch/filesystem/mapred/system/ 
submit_l03eux/.job.xml.crc. blk_4008532917963069756 is created and  
added to pendingCreates and pendingCreateBlocks
2007-02-01 15:28:37,594 DEBUG dfs.DataNode - Number of active  
connections is: 1
2007-02-01 15:28:37,627 INFO  dfs.DataNode - Received block  
blk_4008532917963069756 from /192.168.10.95

----
After this the log is filled with endless groups of 8 fs.FSNamesystem  
- Could not find any nodes with sufficient capacity
(10 replicas requested - 2 slaves?)

This seems to put the server in a loop of eating up all the CPU as  
the tasks run leaving very little for the actual jobs
----

2007-02-01 15:28:40,786 INFO  mapred.JobClient - Running job: job_0001
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,794 INFO  mapred.JobClient -  map 0% reduce 0%
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity


Can anyone provide any help with this problem?

These continue even after I kill the generate task

Jason

Re: Nutch 0.9-dev trunk generate task failing/not completing

Posted by Jason Culverhouse <ja...@mischievous.org>.

I have the dfs.replication value set to 2 in my hadoop-site.xml
Note that in hadoop-defaults.xml the value is 3, so it doesn't  
explain the 10 requested that I see in the logs.
again, this only happens on the generate task.

The generate eventually finishes ~30 to 40 minutes to select out 5000  
URL's since it's competing with the loop that generates these messages.

I noticed that the hadoop.log  rolled this morning and the logging of
fs.FSNamesystem - Could not find any nodes with sufficient capacity

seems to have stopped.

Jason

On Feb 2, 2007, at 5:49 AM, djames wrote:

>
> Hello
>
> I'm not sure, but there is a parameter in hadoop-site.xml conf file  
> that
> could be a solution to your probleme:
>
>
> <property>
>   <name>dfs.replication</name>
>   <value>2</value>
> </property>
>

Re: Nutch 0.9-dev trunk generate task failing/not completing

Posted by djames <dj...@supinfo.com>.

Hello

I'm not sure, but there is a parameter in hadoop-site.xml conf file that
could be a solution to your probleme:


<property>
  <name>dfs.replication</name>
  <value>2</value>
</property>

you can find the explication for this parameter in the nutchhadoop tutorial:
The dfs.replication property states how many servers a single file should be
replicated to before it becomes available. Because we are using only a
single server for right now we have this at 1. If you set this value higher
than the number of data nodes that you have available then you will start
seeing alot of (Zero targets found, forbidden1.size=1) type errors in the
logs. We will increase this value as we add more nodes.
-- 
View this message in context: http://www.nabble.com/Nutch-0.9-dev-trunk-generate-task-failing-not-completing-tf3158347.html#a8767104
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Nutch 0.9-dev trunk generate task failing/not completing

Posted by Reddeppa Naidu <pa...@gmail.com>.

hi
i am also getting same problem.
let me know if any body explains.

Thanks In advance



On 2/2/07, Jason Culverhouse <ja...@mischievous.org> wrote:
>
> I have a 2 node testing cluster, I can  no longer run the generate
> task with out a log filled with errors (after about 19 successful
> iterations)
>
> running
> ./bin/nutch generate crawl/crawldb testsegments -topN 10
>
> generates the following
> ----
> 2007-02-01 15:28:37,510 WARN  fs.FSNamesystem - Replication requested
> of 10 is larger than cluster size (2). Using cluster size.
> 2007-02-01 15:28:37,510 DEBUG dfs.StateChange - DIR*
> NameSystem.startFile: add /home/nutch/nutch/filesystem/mapred/system/
> submit_l03eux/.job.xml.crc to pendingCreates for DFSClient_-980095089
> 2007-02-01 15:28:37,511 DEBUG dfs.StateChange - BLOCK*
> NameSystem.allocateBlock: /home/nutch/nutch/filesystem/mapred/system/
> submit_l03eux/.job.xml.crc. blk_4008532917963069756 is created and
> added to pendingCreates and pendingCreateBlocks
> 2007-02-01 15:28:37,594 DEBUG dfs.DataNode - Number of active
> connections is: 1
> 2007-02-01 15:28:37,627 INFO  dfs.DataNode - Received block
> blk_4008532917963069756 from /192.168.10.95
>
> ----
> After this the log is filled with endless groups of 8 fs.FSNamesystem
> - Could not find any nodes with sufficient capacity
> (10 replicas requested - 2 slaves?)
>
> This seems to put the server in a loop of eating up all the CPU as
> the tasks run leaving very little for the actual jobs
> ----
>
> 2007-02-01 15:28:40,786 INFO  mapred.JobClient - Running job: job_0001
> 2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:41,794 INFO  mapred.JobClient -  map 0% reduce 0%
> 2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
> 2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any
> nodes with sufficient capacity
>
>
> Can anyone provide any help with this problem?
>
> These continue even after I kill the generate task
>
> Jason
>
>

RE: Nutch 0.9-dev trunk generate task failing/not completing

Posted by Gal Nitzan <gn...@usa.net>.

Hi,

The messages you show are taken from the namenode logs and it seems like the
settings of the replication is too high.

Since you are using 2 machines the reasonable replication for dfs block is
two.

So you need to add the following property to your conf/hadoop-site.xml

<property>
  <name>dfs.replication</name>
  <value>2</value>
</property>

This usually got nothing to do with the generate so please specify what is
the message you get from the generate process.

HTH,

Gal

-----Original Message-----
From: Jason Culverhouse [mailto:jason@mischievous.org] 
Sent: Friday, February 02, 2007 2:28 AM
To: nutch-user@lucene.apache.org
Subject: Nutch 0.9-dev trunk generate task failing/not completing

I have a 2 node testing cluster, I can  no longer run the generate  
task with out a log filled with errors (after about 19 successful  
iterations)

running
./bin/nutch generate crawl/crawldb testsegments -topN 10

generates the following
----
2007-02-01 15:28:37,510 WARN  fs.FSNamesystem - Replication requested  
of 10 is larger than cluster size (2). Using cluster size.
2007-02-01 15:28:37,510 DEBUG dfs.StateChange - DIR*  
NameSystem.startFile: add /home/nutch/nutch/filesystem/mapred/system/ 
submit_l03eux/.job.xml.crc to pendingCreates for DFSClient_-980095089
2007-02-01 15:28:37,511 DEBUG dfs.StateChange - BLOCK*  
NameSystem.allocateBlock: /home/nutch/nutch/filesystem/mapred/system/ 
submit_l03eux/.job.xml.crc. blk_4008532917963069756 is created and  
added to pendingCreates and pendingCreateBlocks
2007-02-01 15:28:37,594 DEBUG dfs.DataNode - Number of active  
connections is: 1
2007-02-01 15:28:37,627 INFO  dfs.DataNode - Received block  
blk_4008532917963069756 from /192.168.10.95

----
After this the log is filled with endless groups of 8 fs.FSNamesystem  
- Could not find any nodes with sufficient capacity
(10 replicas requested - 2 slaves?)

This seems to put the server in a loop of eating up all the CPU as  
the tasks run leaving very little for the actual jobs
----

2007-02-01 15:28:40,786 INFO  mapred.JobClient - Running job: job_0001
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,328 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,678 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:41,794 INFO  mapred.JobClient -  map 0% reduce 0%
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,320 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity
2007-02-01 15:28:44,699 WARN  fs.FSNamesystem - Could not find any  
nodes with sufficient capacity


Can anyone provide any help with this problem?

These continue even after I kill the generate task

Jason