You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Aaron Kimball (JIRA)" <ji...@apache.org> on 2009/09/22 02:31:16 UTC

[jira] Created: (MAPREDUCE-1017) Compression and output splitting for Sqoop

Compression and output splitting for Sqoop
------------------------------------------

                 Key: MAPREDUCE-1017
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1017
             Project: Hadoop Map/Reduce
          Issue Type: New Feature
          Components: contrib/sqoop
            Reporter: Aaron Kimball
            Assignee: Aaron Kimball


Sqoop "direct mode" writing will generate a single large text file in HDFS. It is important to be able to compress this data before it reaches HDFS. Due to the difficulty in splitting compressed files in HDFS for use by MapReduce jobs, data should also be split at compression time.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.