You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "firegun (JIRA)" <ji...@apache.org> on 2013/11/05 03:48:18 UTC

[jira] [Created] (MAPREDUCE-5606) JobTracker blocked for DFSClient: Failed recovery attempt

firegun created MAPREDUCE-5606:
----------------------------------

             Summary: JobTracker blocked for DFSClient: Failed recovery attempt
                 Key: MAPREDUCE-5606
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5606
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: jobtracker
    Affects Versions: 1.0.3
         Environment: centos 5.8  jdk 1.7 
            Reporter: firegun
            Priority: Critical


when a  datanode was crash,the server can  ping ok,but can not  call rpc ,and also can not ssh login. and then jobTracker may be request a block on this datanode.
it will happened ,the  JobTracker can not work,the webUI is also unwork,hadoop job -list also unwork,the jobTracker logs no other info .

and then we need to restart the datanode.
then jobTraker can work too,but the taskTracker num come to zero,
we need run : hadoop mradmin -refreshNodes
then the JobTracker begin to add taskTraker ,but is very slowly.

this problem occur 5time  in 2weeks.




--
This message was sent by Atlassian JIRA
(v6.1#6144)