You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2009/12/10 01:05:18 UTC

[jira] Created: (HDFS-821) Garbage collect datanode tmp dirs

Garbage collect datanode tmp dirs
---------------------------------

                 Key: HDFS-821
                 URL: https://issues.apache.org/jira/browse/HDFS-821
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: data-node
    Affects Versions: 0.22.0
            Reporter: Todd Lipcon


I've seen in practice (and it's been reported on the list) cases where the datanode's tmp dir can become quite full with abandoned blocks. There's an ancient comment from April 07:
{code}
  // REMIND - mjc - eventually we should have a timeout system
  // in place to clean up block files left by abandoned clients.
  // We should have some timer in place, so that if a blockfile
  // is created but non-valid, and has been idle for >48 hours,
  // we can GC it safely.
{code}

Well, we can consider ourselves reminded, so let's do it!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.