You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Saptarshi Guha <sa...@gmail.com> on 2008/05/20 18:03:48 UTC

Meaning of "Data-local map tasks" in the web status gui to MapReduce

Hello,
	Does the "Data-local map tasks" counter mean the number of tasks that  
the had the input data already present on the machine on they are  
running on? i.e the wasn't a need to ship the data to them.
	Thanks
	Saptarsh

Saptarshi Guha | saptarshi.guha@gmail.com | http://www.stat.purdue.edu/~sguha


Re: Meaning of "Data-local map tasks" in the web status gui to MapReduce

Posted by Arun C Murthy <ar...@yahoo-inc.com>.
On May 20, 2008, at 9:03 AM, Saptarshi Guha wrote:

> Hello,
> 	Does the "Data-local map tasks" counter mean the number of tasks  
> that the had the input data already present on the machine on they  
> are running on? i.e the wasn't a need to ship the data to them.

Yes.  Your understanding is correct.

More specifically it means that the map-task got scheduled on a  
machine on which one of the replicas of it's input-split-block was  
present and was served by the datanode running on that machine. *smile*

Arun