You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Venkatesh S (JIRA)" <ji...@apache.org> on 2009/06/09 14:00:07 UTC
[jira] Created: (HADOOP-5999) Support for concatenating of files
into a single file
Support for concatenating of files into a single file
-----------------------------------------------------
Key: HADOOP-5999
URL: https://issues.apache.org/jira/browse/HADOOP-5999
Project: Hadoop Core
Issue Type: New Feature
Components: dfs
Reporter: Venkatesh S
An API to concatenate files of same size and replication factor on HDFS into a single larger file.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HADOOP-5999) Support for concatenating of files
into a single file
Posted by "Venkatesh S (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-5999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717994#action_12717994 ]
Venkatesh S commented on HADOOP-5999:
-------------------------------------
It can always throw an exception if the replication factor or sizes do not match.
> Support for concatenating of files into a single file
> -----------------------------------------------------
>
> Key: HADOOP-5999
> URL: https://issues.apache.org/jira/browse/HADOOP-5999
> Project: Hadoop Core
> Issue Type: New Feature
> Components: dfs
> Reporter: Venkatesh S
>
> An API to concatenate files of same size and replication factor on HDFS into a single larger file.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HADOOP-5999) Support for concatenating of files
into a single file
Posted by "Owen O'Malley (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-5999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717845#action_12717845 ]
Owen O'Malley commented on HADOOP-5999:
---------------------------------------
I don't think this is a good feature to expose. It has the unfortunate characteristic that it will perform very badly on most file systems and even in some non-trivial cases of HDFS. Take for instance the case where the files have different replication factors or block sizes.
> Support for concatenating of files into a single file
> -----------------------------------------------------
>
> Key: HADOOP-5999
> URL: https://issues.apache.org/jira/browse/HADOOP-5999
> Project: Hadoop Core
> Issue Type: New Feature
> Components: dfs
> Reporter: Venkatesh S
>
> An API to concatenate files of same size and replication factor on HDFS into a single larger file.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.