You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Matt Foley (JIRA)" <ji...@apache.org> on 2011/09/23 03:19:26 UTC

[jira] [Created] (MAPREDUCE-3077) re-enable faulty TaskTracker storage without restarting TT, when appropriate

re-enable faulty TaskTracker storage without restarting TT, when appropriate
----------------------------------------------------------------------------

                 Key: MAPREDUCE-3077
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3077
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: tasktracker
    Affects Versions: 0.20.205.0
            Reporter: Matt Foley


In MAPREDUCE-2928, Ravi Gummadi proposed:
bq. we can add LocalStorage.checkBadLocalDirs() call to TT.initialize() that can do disk-health-check of bad local dirs and add dirs to the good local dirs list if they become good.
and Eli Collins added:
bq. Sounds good. Since transient disk failures may cause a file system to become read-only (causing permanent failures) sometimes re-mounting is sufficient to recover in which case it makes sense to re-enable faulty disks w/o TT restart.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Re: [jira] [Created] (MAPREDUCE-3077) re-enable faulty TaskTracker storage without restarting TT, when appropriate

Posted by Koji Noguchi <kn...@yahoo-inc.com>.


On 9/22/11 6:19 PM, "Matt Foley (JIRA)" <ji...@apache.org> wrote:

> re-enable faulty TaskTracker storage without restarting TT, when appropriate
> ----------------------------------------------------------------------------
> 
>                  Key: MAPREDUCE-3077
>                  URL: https://issues.apache.org/jira/browse/MAPREDUCE-3077
>              Project: Hadoop Map/Reduce
>           Issue Type: Improvement
>           Components: tasktracker
>     Affects Versions: 0.20.205.0
>             Reporter: Matt Foley
> 
> 
> In MAPREDUCE-2928, Ravi Gummadi proposed:
> bq. we can add LocalStorage.checkBadLocalDirs() call to TT.initialize() that
> can do disk-health-check of bad local dirs and add dirs to the good local dirs
> list if they become good.
> and Eli Collins added:
> bq. Sounds good. Since transient disk failures may cause a file system to
> become read-only (causing permanent failures) sometimes re-mounting is
> sufficient to recover in which case it makes sense to re-enable faulty disks
> w/o TT restart.
> 
> --
> This message is automatically generated by JIRA.
> For more information on JIRA, see: http://www.atlassian.com/software/jira
> 
>