You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Ramgopal N (Created) (JIRA)" <ji...@apache.org> on 2011/10/18 13:06:10 UTC

[jira] [Created] (MAPREDUCE-3201) Even though jobs are getting failed on particular NM, it is not getting blacklisted

Even though jobs are getting failed on particular NM, it is not getting blacklisted
-----------------------------------------------------------------------------------

                 Key: MAPREDUCE-3201
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3201
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: mrv2
    Affects Versions: 0.23.0
            Reporter: Ramgopal N
            Priority: Minor


{code:xml}
The yarnchild process on a particular NM are getting killed continuosly. 
Still the NM is not getting blacklisted
{code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-3201) Even though jobs are getting failed on particular NM, it is not getting blacklisted

Posted by "Vinod Kumar Vavilapalli (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-3201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13129673#comment-13129673 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3201:
----------------------------------------------------

Ramgopal, we do have job-level blacklist today. Can you look at your AM logs and grep for the following log messsages? Thanks.
{quote}
nodeBlacklistingEnabled:
maxTaskFailuresPerNode is 
 failures on node 
Blacklisted host 
{quote}

There is a known bug related to NM blacklisting by MR jobs - MAPREDUCE-2693, but doesn't look like you are running into that.
                
> Even though jobs are getting failed on particular NM, it is not getting blacklisted
> -----------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3201
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3201
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 0.23.0
>            Reporter: Ramgopal N
>            Priority: Minor
>
> {code:xml}
> The yarnchild process on a particular NM are getting killed continuosly. 
> Still the NM is not getting blacklisted
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira