You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Amareshwari Sriramadasu (JIRA)" <ji...@apache.org> on 2008/05/26 11:45:56 UTC

[jira] Updated: (HADOOP-2427) Cleanup of mapred.local.dir after maptask is complete

     [ https://issues.apache.org/jira/browse/HADOOP-2427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Amareshwari Sriramadasu updated HADOOP-2427:
--------------------------------------------

    Attachment: patch-2427.txt

Currently the failed and killed tasks are cleanedup as soon as they report as finished. But successful tasks are cleanedup at the end of the job, so that the map outputs will be available for the reducers.  But now, since we have <taskid>/work as current working directory for the task, and <taskid>/output directory for intermediate map output files. So, <taskid>/work can be cleanedup for successful tasks as soon as the task is finished.

Here is a patch doing cleanup of workdir for successful tasks.

> Cleanup of mapred.local.dir after maptask is complete
> -----------------------------------------------------
>
>                 Key: HADOOP-2427
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2427
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.15.1
>            Reporter: lohit vijayarenu
>             Fix For: 0.18.0
>
>         Attachments: patch-2427.txt
>
>
> I see that after a map task is complete, its working directory (mapred.local.dir)/taskTracker/jobcache/<jobid>/<task_dir> is not deleted untill the job is complete. If map out files are stored in there, could this be created in different directory and the working directory cleaned up after map task is complete. One problem we are seeing is, if a map task creates files temporary files, they get accumulated and we may run out of disk space thus failing the job. Relying on the user to cleanup all temp files created is be error prone.  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.