You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by "Bible, Landy" <la...@utulsa.edu> on 2012/01/04 19:03:37 UTC

Balancer exiting immediately despite having work to do.

Hi all,

I'm running Hadoop 0.20.2.  The balancer has suddenly stopped working.  I'm attempting to balance the cluster with a threshold of 1, using the following command:

./hadoop balancer -threshold 1

This has been working fine, but suddenly it isn't.  It skips though 5 iterations without actually doing any work:

Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To Move  Bytes Being Moved
Jan 4, 2012 11:47:56 AM           0                 0 KB             1.87 GB            6.68 GB
Jan 4, 2012 11:47:56 AM           1                 0 KB             1.87 GB            6.68 GB
Jan 4, 2012 11:47:56 AM           2                 0 KB             1.87 GB            6.68 GB
Jan 4, 2012 11:47:57 AM           3                 0 KB             1.87 GB            6.68 GB
Jan 4, 2012 11:47:57 AM           4                 0 KB             1.87 GB            6.68 GB
No block has been moved for 5 iterations. Exiting...
Balancing took 524.0 milliseconds

I've attached the full log, but I can't see any errors indicating why it is failing.  Any ideas?  I'd really like to get balancing working again.  My use case isn't the norm, and it is important that the cluster stay as close to completely balanced as possible.

--
Landy Bible

Simulation and Computer Specialist
School of Nursing - Collins College of Business
The University of Tulsa


RE: Balancer exiting immediately despite having work to do.

Posted by "Bible, Landy" <la...@utulsa.edu>.
James,

http://pastebin.com/mYBRKDew

Tomorrow I'll run the balancer again and grab a copy of the namenode logs as well.  Didn't think of that today.

-Landy

-----Original Message-----
From: jameswarren3@gmail.com [mailto:jameswarren3@gmail.com] On Behalf Of James Warren
Sent: Wednesday, January 04, 2012 7:49 PM
To: common-user@hadoop.apache.org
Subject: Re: Balancer exiting immediately despite having work to do.

Hi Landy -

Attachments are stripped from e-mails sent to the mailing list.  Could you publish your logs on pastebin and forward the url?

cheers,
-James

On Wed, Jan 4, 2012 at 10:03 AM, Bible, Landy <la...@utulsa.edu>wrote:

> Hi all,****
>
> ** **
>
> I'm running Hadoop 0.20.2.  The balancer has suddenly stopped working.
> I'm attempting to balance the cluster with a threshold of 1, using the 
> following command:****
>
> ** **
>
> ./hadoop balancer -threshold 1****
>
> ** **
>
> This has been working fine, but suddenly it isn't.  It skips though 5 
> iterations without actually doing any work:****
>
> ** **
>
> Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To
> Move  Bytes Being Moved****
>
> Jan 4, 2012 11:47:56 AM           0                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:56 AM           1                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:56 AM           2                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:57 AM           3                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:57 AM           4                 0 KB             1.87
> GB            6.68 GB****
>
> No block has been moved for 5 iterations. Exiting...****
>
> Balancing took 524.0 milliseconds****
>
> ** **
>
> I've attached the full log, but I can't see any errors indicating why 
> it is failing.  Any ideas?  I'd really like to get balancing working again.
> My use case isn't the norm, and it is important that the cluster stay 
> as close to completely balanced as possible.****
>
> ** **
>
> --****
>
> Landy Bible****
>
> ** **
>
> Simulation and Computer Specialist****
>
> School of Nursing - Collins College of Business****
>
> The University of Tulsa****
>
> ** **
>

Re: Balancer exiting immediately despite having work to do.

Posted by James Warren <ja...@stanfordalumni.org>.
Hi Landy -

Attachments are stripped from e-mails sent to the mailing list.  Could you
publish your logs on pastebin and forward the url?

cheers,
-James

On Wed, Jan 4, 2012 at 10:03 AM, Bible, Landy <la...@utulsa.edu>wrote:

> Hi all,****
>
> ** **
>
> I’m running Hadoop 0.20.2.  The balancer has suddenly stopped working.
> I’m attempting to balance the cluster with a threshold of 1, using the
> following command:****
>
> ** **
>
> ./hadoop balancer –threshold 1****
>
> ** **
>
> This has been working fine, but suddenly it isn’t.  It skips though 5
> iterations without actually doing any work:****
>
> ** **
>
> Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To
> Move  Bytes Being Moved****
>
> Jan 4, 2012 11:47:56 AM           0                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:56 AM           1                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:56 AM           2                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:57 AM           3                 0 KB             1.87
> GB            6.68 GB****
>
> Jan 4, 2012 11:47:57 AM           4                 0 KB             1.87
> GB            6.68 GB****
>
> No block has been moved for 5 iterations. Exiting...****
>
> Balancing took 524.0 milliseconds****
>
> ** **
>
> I’ve attached the full log, but I can’t see any errors indicating why it
> is failing.  Any ideas?  I’d really like to get balancing working again.
> My use case isn’t the norm, and it is important that the cluster stay as
> close to completely balanced as possible.****
>
> ** **
>
> --****
>
> Landy Bible****
>
> ** **
>
> Simulation and Computer Specialist****
>
> School of Nursing – Collins College of Business****
>
> The University of Tulsa****
>
> ** **
>