You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Steve Loughran (JIRA)" <ji...@apache.org> on 2013/11/05 18:32:17 UTC

[jira] [Commented] (MAPREDUCE-5606) JobTracker blocked for DFSClient: Failed recovery attempt

    [ https://issues.apache.org/jira/browse/MAPREDUCE-5606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13814061#comment-13814061 ] 

Steve Loughran commented on MAPREDUCE-5606:
-------------------------------------------

Does this still happen when you upgrade to the most recent version of Hadoop 1.x (or better yet, Hadoop 2.2?)

> JobTracker blocked for DFSClient: Failed recovery attempt
> ---------------------------------------------------------
>
>                 Key: MAPREDUCE-5606
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5606
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: jobtracker
>    Affects Versions: 1.0.3
>         Environment: centos 5.8  jdk 1.7 
>            Reporter: firegun
>            Priority: Critical
>
> when a  datanode was crash,the server can  ping ok,but can not  call rpc ,and also can not ssh login. and then jobTracker may be request a block on this datanode.
> it will happened ,the  JobTracker can not work,the webUI is also unwork,hadoop job -list also unwork,the jobTracker logs no other info .
> and then we need to restart the datanode.
> then jobTraker can work too,but the taskTracker num come to zero,
> we need run : hadoop mradmin -refreshNodes
> then the JobTracker begin to add taskTraker ,but is very slowly.
> this problem occur 5time  in 2weeks.



--
This message was sent by Atlassian JIRA
(v6.1#6144)