You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Runping Qi (JIRA)" <ji...@apache.org> on 2006/04/15 02:27:00 UTC

[jira] Commented: (HADOOP-137) Different TaskTrackers may get the same task tracker id, thus cause many problems.

    [ http://issues.apache.org/jira/browse/HADOOP-137?page=comments#action_12374587 ] 

Runping Qi commented on HADOOP-137:
-----------------------------------


A simple fix for the issue is to add hostname as part of task track names. 
I believe Owen has actually done this in trying to fix another issue, but the patch
is not committed yet.



> Different TaskTrackers may get the same task tracker id, thus cause many problems.
> ----------------------------------------------------------------------------------
>
>          Key: HADOOP-137
>          URL: http://issues.apache.org/jira/browse/HADOOP-137
>      Project: Hadoop
>         Type: Bug

>   Components: mapred
>     Reporter: Runping Qi
>     Priority: Critical

>
> In the TaskTracker#Initialize method, the following line assigns task tracker name (id):
> this.taskTrackerName = "tracker_" + (Math.abs(r.nextInt()) % 100000);
> For a fair size cluster, it is possible that different task trackers to get the same id, causing name conflict.
> I encountered this problem with a cluster of 274 nodes. Once such conflict happens, a lot of strange things may happen.
> For example, a reducer task tried to copy from a machine (task tracker) a map output file that was actually produced 
> on another machine.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira