You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Rakesh Sharma (JIRA)" <ji...@apache.org> on 2015/09/22 07:51:04 UTC

[jira] [Created] (SQOOP-2585) merging hive tables using sqoop

Rakesh Sharma created SQOOP-2585:
------------------------------------

             Summary: merging hive tables using sqoop
                 Key: SQOOP-2585
                 URL: https://issues.apache.org/jira/browse/SQOOP-2585
             Project: Sqoop
          Issue Type: New Feature
            Reporter: Rakesh Sharma
            Assignee: Rakesh Sharma


Sqoop currently doesn't support merging of two hive tables. Implement a new tool for Sqoop which :

1. merges two hive tables. (This will be most commonly used to merge new incremental data into an existing hive table)
2. should support both partitioned and non partitioned tables
3. supports merge on composite keys
4. For partitioned tables, merges old partitions and also add new partitions.
5. Supports Text/RC/ORC/Seq file formats
6. Should make sure that only process is performing merge.
7. should be an atomic operation. If it fails at any point , it should revert the target hive table to its original state.






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)