You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-dev@hadoop.apache.org by "Tsz Wo (Nicholas), SZE (JIRA)" <ji...@apache.org> on 2008/11/04 03:25:44 UTC

[jira] Assigned: (HADOOP-2002) Eliminate redundant searches in the namespace directory tree.

     [ https://issues.apache.org/jira/browse/HADOOP-2002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tsz Wo (Nicholas), SZE reassigned HADOOP-2002:
----------------------------------------------

    Assignee:     (was: Tsz Wo (Nicholas), SZE)

I won't be able to work on this issue in the near future.  Setting this to unassigned.

> Eliminate redundant searches in the namespace directory tree.
> -------------------------------------------------------------
>
>                 Key: HADOOP-2002
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2002
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.13.0
>            Reporter: Konstantin Shvachko
>
> There is no need to look for the same INode multiple times in the same name-node operation.
> For example in FSNamesystem.exists()
> {code}
>   public boolean exists(String src) {
>     if (dir.getFileBlocks(src) != null || dir.isDir(src)) {
>       return true;
>     } else {
>       return false;
>     }
>   }
> {code}
> both getFileBlocks() and isDir() call rootDir.getNode(src) inside, which causes two separate lookups in the directory tree while one is enough.
>  Why not check whether the inode is a directory as well as that it has blocks at the same time.
> Other methods do the same thing.
> - completeFile() calls getINode in different parts at least 3 times.
> - getAdditionalBlock() - 2 getINode calls
> - startFile() - I counted 5 calls, may be missed some.
> In order to prevent that we should define all methods beyond the top level based on INode parameters rather than path names. 
> E.g. all FSDirectory methods should take INode as a parameter, not the String.
> We should be careful though not to use INode across separate synchronized sections. 
> Once the lock is released the INode should be accessed by the path again.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.