You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Hairong Kuang (JIRA)" <ji...@apache.org> on 2010/07/22 00:17:18 UTC
[jira] Commented: (HADOOP-6870) Add FileSystem#listLocatedStatus to
list a directory's content together with each file's block locations
[ https://issues.apache.org/jira/browse/HADOOP-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890928#action_12890928 ]
Hairong Kuang commented on HADOOP-6870:
---------------------------------------
Note that listLocatedStatus lists only files. So the API name might be better to be listFiles().
{code}
Iterator<LocatedFileStatus> listFiles(Path path, boolean isRecursive);
{code}
> Add FileSystem#listLocatedStatus to list a directory's content together with each file's block locations
> --------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-6870
> URL: https://issues.apache.org/jira/browse/HADOOP-6870
> Project: Hadoop Common
> Issue Type: New Feature
> Components: fs
> Affects Versions: 0.22.0
> Reporter: Hairong Kuang
> Assignee: Hairong Kuang
> Fix For: 0.22.0
>
>
> This jira implements the new FileSystem API as proposed in HDFS-202. The new API aims to eliminate individual "getFileBlockLocations" calls to NN for each file in the input directory of a job. Instead, a file's block locations are returned together with FileStatus when listing a directory, thus improving getSplits performance.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.