You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Runping Qi (JIRA)" <ji...@apache.org> on 2006/04/15 02:27:00 UTC
[jira] Commented: (HADOOP-137) Different TaskTrackers may get the
same task tracker id, thus cause many problems.
[ http://issues.apache.org/jira/browse/HADOOP-137?page=comments#action_12374587 ]
Runping Qi commented on HADOOP-137:
-----------------------------------
A simple fix for the issue is to add hostname as part of task track names.
I believe Owen has actually done this in trying to fix another issue, but the patch
is not committed yet.
> Different TaskTrackers may get the same task tracker id, thus cause many problems.
> ----------------------------------------------------------------------------------
>
> Key: HADOOP-137
> URL: http://issues.apache.org/jira/browse/HADOOP-137
> Project: Hadoop
> Type: Bug
> Components: mapred
> Reporter: Runping Qi
> Priority: Critical
>
> In the TaskTracker#Initialize method, the following line assigns task tracker name (id):
> this.taskTrackerName = "tracker_" + (Math.abs(r.nextInt()) % 100000);
> For a fair size cluster, it is possible that different task trackers to get the same id, causing name conflict.
> I encountered this problem with a cluster of 274 nodes. Once such conflict happens, a lot of strange things may happen.
> For example, a reducer task tried to copy from a machine (task tracker) a map output file that was actually produced
> on another machine.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira