You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Sandy Ryza (JIRA)" <ji...@apache.org> on 2013/12/04 19:28:36 UTC

[jira] [Resolved] (HADOOP-10145) Reduce task stuck on 0.16666667%

     [ https://issues.apache.org/jira/browse/HADOOP-10145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sandy Ryza resolved HADOOP-10145.
---------------------------------

    Resolution: Invalid

Please ask for assistance on the Hadoop user list.  JIRA is for reporting bugs.

> Reduce task stuck on 0.16666667%
> --------------------------------
>
>                 Key: HADOOP-10145
>                 URL: https://issues.apache.org/jira/browse/HADOOP-10145
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: conf
>    Affects Versions: 0.20.2
>         Environment: OS:  RHEL 6.4
> Hadoop version:  0.20.2-cdh3u6
>            Reporter: vikash kumar
>
> All of sudden, one of the Hadoop jobs is stuck, basically the reduce takes forever to complete(we have waited for 30 hours, usually it takes an hour to complete).
> in tasktracker logs i see tons of following messages, however at times, resubmitting the same job works fine. 
> 2013-12-04 00:00:00,381 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159167_r_000041_0 0.16666667% reduce > copy (1 of 2 at 0.01 MB/s) >
> 2013-12-04 00:00:00,750 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159167_r_000048_0 0.16666667% reduce > copy (1 of 2 at 0.01 MB/s) >
> 2013-12-04 00:00:01,729 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159262_r_000046_0 0.16666667% reduce > copy (1 of 2 at 0.03 MB/s) >
> 2013-12-04 00:00:01,918 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159262_r_000055_0 0.16666667% reduce > copy (1 of 2 at 0.03 MB/s) >
> 2013-12-04 00:00:01,919 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159262_r_000021_0 0.16666667% reduce > copy (1 of 2 at 0.03 MB/s) >
> 2013-12-04 00:00:01,922 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159262_r_000031_0 0.16666667% reduce > copy (1 of 2 at 0.03 MB/s) >
> 2013-12-04 00:00:01,940 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159262_r_000057_0 0.16666667% reduce > copy (1 of 2 at 0.03 MB/s) >
> 2013-12-04 00:00:02,443 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201310070546_159167_r_000047_0 0.16666667% reduce > copy (1 of 2 at 0.01 MB/s) >
> there are no other resonable clues in log for me to get a direction on, what am i looking for. with my setup, upgrading to new version is not an option.
> please help!



--
This message was sent by Atlassian JIRA
(v6.1#6144)