You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Vinod Kone (JIRA)" <ji...@apache.org> on 2013/02/07 22:55:13 UTC

[jira] [Resolved] (MESOS-17) Hadoop executors killed while tasks in COMMIT_PENDING

     [ https://issues.apache.org/jira/browse/MESOS-17?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinod Kone resolved MESOS-17.
-----------------------------

    Resolution: Fixed

This shouldn't be a problem anymore with the latest Hadoop refactor.
                
> Hadoop executors killed while tasks in COMMIT_PENDING
> -----------------------------------------------------
>
>                 Key: MESOS-17
>                 URL: https://issues.apache.org/jira/browse/MESOS-17
>             Project: Mesos
>          Issue Type: Bug
>          Components: isolation
>         Environment: LXC isolation module, Hadoop framework
>            Reporter: Charles Reiss
>            Priority: Minor
>              Labels: hadoop, lxc
>
> The Hadoop framework considers tasks finished when they are in the COMMIT_PENDING state. When using the LXC isolation module, this can cause the Hadoop executor's memory allocation to be reduced before the task actually commits. When this happens, the Hadoop executor is sometimes killed for exceeding its memory allocation, leaving the tasks stalled until the master detects the lost task tracker by timeout.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira