You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Amareshwari Sriramadasu (JIRA)" <ji...@apache.org> on 2008/08/14 12:37:44 UTC
[jira] Commented: (HADOOP-3807) task attempt failing to report
status just after the intialization
[ https://issues.apache.org/jira/browse/HADOOP-3807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12622507#action_12622507 ]
Amareshwari Sriramadasu commented on HADOOP-3807:
-------------------------------------------------
Saw this exception many times.
The lines corresponding to the exception are
{code}
2139 TaskUmbilicalProtocol umbilical =
2140 (TaskUmbilicalProtocol)RPC.getProxy(TaskUmbilicalProtocol.class,
2141 TaskUmbilicalProtocol.versionID,
2142 address,
2143 defaultConf);
2144
2145 Task task = umbilical.getTask(taskid);
2146 JobConf job = new JobConf(task.getJobFile());
2147 TaskLog.cleanup(job.getInt("mapred.userlog.retain.hours", 24));
{code}
Looks like exception is because RPC.getProxy is returning a *null* .
> task attempt failing to report status just after the intialization
> ------------------------------------------------------------------
>
> Key: HADOOP-3807
> URL: https://issues.apache.org/jira/browse/HADOOP-3807
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Reporter: Amareshwari Sriramadasu
> Fix For: 0.19.0
>
>
> In sort500 runs, I noticed task attempts failing to report status : saying
> Task attempt_200807220707_0002_r_000336_1 failed to report status for 605 seconds. Killing!
> And the task logs has NullPointerException saying:
> *stderr logs*
> Exception in thread "main" java.lang.NullPointerException
> at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2146)
> Task tracker logs for the same attempt are:
> 2008-07-22 08:07:37,110 INFO org.apache.hadoop.mapred.TaskTracker: LaunchTaskAction: attempt_200807220707_0002_r_000336_1
> 2008-07-22 08:17:42,144 INFO org.apache.hadoop.mapred.TaskTracker: attempt_200807220707_0002_r_000336_1: Task attempt_200807220707_0002_r_000336_1 failed toreport status for 605 seconds. Killing!
> 2008-07-22 08:17:42,162 INFO org.apache.hadoop.mapred.TaskTracker: About to purge task: attempt_200807220707_0002_r_000336_1
> 2008-07-22 08:17:42,163 INFO org.apache.hadoop.mapred.TaskRunner: attempt_200807220707_0002_r_000336_1 done; removing files.
> 2008-07-22 08:18:15,481 WARN org.apache.hadoop.mapred.TaskTracker: Unknown child task finshed: attempt_200807220707_0002_r_000336_1. Ignored.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.