You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Matt Foley (JIRA)" <ji...@apache.org> on 2011/09/23 03:19:26 UTC
[jira] [Created] (MAPREDUCE-3077) re-enable faulty TaskTracker
storage without restarting TT, when appropriate
re-enable faulty TaskTracker storage without restarting TT, when appropriate
----------------------------------------------------------------------------
Key: MAPREDUCE-3077
URL: https://issues.apache.org/jira/browse/MAPREDUCE-3077
Project: Hadoop Map/Reduce
Issue Type: Improvement
Components: tasktracker
Affects Versions: 0.20.205.0
Reporter: Matt Foley
In MAPREDUCE-2928, Ravi Gummadi proposed:
bq. we can add LocalStorage.checkBadLocalDirs() call to TT.initialize() that can do disk-health-check of bad local dirs and add dirs to the good local dirs list if they become good.
and Eli Collins added:
bq. Sounds good. Since transient disk failures may cause a file system to become read-only (causing permanent failures) sometimes re-mounting is sufficient to recover in which case it makes sense to re-enable faulty disks w/o TT restart.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [jira] [Created] (MAPREDUCE-3077) re-enable faulty TaskTracker
storage without restarting TT, when appropriate
Posted by Koji Noguchi <kn...@yahoo-inc.com>.
On 9/22/11 6:19 PM, "Matt Foley (JIRA)" <ji...@apache.org> wrote:
> re-enable faulty TaskTracker storage without restarting TT, when appropriate
> ----------------------------------------------------------------------------
>
> Key: MAPREDUCE-3077
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3077
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: tasktracker
> Affects Versions: 0.20.205.0
> Reporter: Matt Foley
>
>
> In MAPREDUCE-2928, Ravi Gummadi proposed:
> bq. we can add LocalStorage.checkBadLocalDirs() call to TT.initialize() that
> can do disk-health-check of bad local dirs and add dirs to the good local dirs
> list if they become good.
> and Eli Collins added:
> bq. Sounds good. Since transient disk failures may cause a file system to
> become read-only (causing permanent failures) sometimes re-mounting is
> sufficient to recover in which case it makes sense to re-enable faulty disks
> w/o TT restart.
>
> --
> This message is automatically generated by JIRA.
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>
>