You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-user@hadoop.apache.org by Tonci Buljan <to...@gmail.com> on 2010/07/09 11:32:30 UTC

Terasort problem

Hello everyone,

I have a cluster from 8 datanodes and a namenode.
When I start teragen program everything works OK, the data is generated. But
when I start terasort program, seems like that only 2 datanodes do the job.
And everything is so slow. I've tried with only 10 records and cluster
generated sort in few seconds, but with bigger number, it stucks around 15%.

Do you have any idea why this is so?

I'm using Hadoop 0.20.2 and Ubuntu 8.10.


Thank you.

Re: Terasort problem

Posted by Owen O'Malley <om...@apache.org>.

On Jul 10, 2010, at 4:29 AM, Tonci Buljan wrote:

> mapred.tasktracker.reduce.tasks.maximum <- Is this configured on every
> datanode separately? What number shall I put here?
>
> mapred.tasktracker.map.tasks.maximum <- same question  as
> mapred.tasktracker.reduce.tasks.maximum

Generally, RAM is the scarce resource. Decide how you want to divide  
your worker's RAM between tasks. So with 6 G of RAM,  I'd probably  
make 4 map slots of 0.75G each and 2 reduce slots of 1.5G each.

> mapred.reduce.tasks <- Is this configured ONLY on Namenode and what  
> value
> should it have for my 8 node cluster?

You should set it to your reduce task capacity of 2 * 8 = 16.

> mapred.map.tasks <- same question as mapred.reduce.tasks

It matters less, but go ahead and set it to the map capacity of 4 * 8  
= 32. More important is to set your vm and buffer sizes for the tasks.  
You also want to set your HDFS block size to be 0.5G to 2G. That will  
make your map inputs the right size.

-- Owen

Re: Terasort problem

Posted by Ted Yu <yu...@gmail.com>.

mapred.tasktracker.reduce.tasks.maximum and
mapred.tasktracker.map.tasks.maximum are configured in mapred-site.xml
They're cluster-wide.

Hadoop would sync configuation from name node to data nodes upon startup,
you don't need to configure for individual datanode.

"Too many fetch-failures..." error appeared in previous discussions and I
don't see definitive cause from them.

On Sat, Jul 10, 2010 at 4:29 AM, Tonci Buljan <to...@gmail.com>wrote:

> Thank you for your response Owen. It is true, I haven't done that, figured
> that few hours after posting here.
>
> I'm having problems with understanding these variables:
>
> mapred.tasktracker.reduce.tasks.maximum <- Is this configured on every
> datanode separately? What number shall I put here?
>
> mapred.tasktracker.map.tasks.maximum <- same question  as
> mapred.tasktracker.reduce.tasks.maximum
>
> mapred.reduce.tasks <- Is this configured ONLY on Namenode and what value
> should it have for my 8 node cluster?
>
> mapred.map.tasks <- same question as mapred.reduce.tasks
>
>
> I've tried playing with these variables but getting error:"Too many
> fetch-failures..."
>
> Please, if anyone have any idea how to setup this the right way.
>
> Thank you.
>
> On 9 July 2010 15:33, Owen O'Malley <om...@apache.org> wrote:
>
> > I would guess that you didn't set the number of reducers for the job,
> > and it defaulted to 2.
> >
> > -- Owen
> >
>

Re: Terasort problem

Posted by Tonci Buljan <to...@gmail.com>.

Thank you for your response Owen. It is true, I haven't done that, figured
that few hours after posting here.

I'm having problems with understanding these variables:

mapred.tasktracker.reduce.tasks.maximum <- Is this configured on every
datanode separately? What number shall I put here?

mapred.tasktracker.map.tasks.maximum <- same question  as
mapred.tasktracker.reduce.tasks.maximum

mapred.reduce.tasks <- Is this configured ONLY on Namenode and what value
should it have for my 8 node cluster?

mapred.map.tasks <- same question as mapred.reduce.tasks

I've tried playing with these variables but getting error:"Too many
fetch-failures..."

Please, if anyone have any idea how to setup this the right way.

Thank you.

On 9 July 2010 15:33, Owen O'Malley <om...@apache.org> wrote:

> I would guess that you didn't set the number of reducers for the job,
> and it defaulted to 2.
>
> -- Owen
>

Re: Terasort problem

Posted by Owen O'Malley <om...@apache.org>.

I would guess that you didn't set the number of reducers for the job,
and it defaulted to 2.

-- Owen