You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by MING-Yuan JIANG <cn...@gmail.com> on 2009/12/17 09:00:28 UTC
Convert Arc file to segement with ArcSegmentCreator,run very slow
Hi everyone
When I convert arc file to segment with the tool ArcSegmentCreator.I found
it run very slow.
My hadoop-site.xml modified someplace as below:
dfs.block.size 50MB
mapred.reduce.tasks 160
mapred.tasktracker.map.tasks.maximum 8
mapred.tasktracker.reduce.tasks.maximum 8
And I have 2 tasktracker,per node has 2 cores.
All the data in hdfs are 100 arc files,each file 100MB.
How to set those parameters above to make job run faster?
thanks!