You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Kopal Niranjan (JIRA)" <ji...@apache.org> on 2016/04/19 11:48:25 UTC
[jira] [Updated] (SQOOP-2585) merging hive tables using sqoop
[ https://issues.apache.org/jira/browse/SQOOP-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kopal Niranjan updated SQOOP-2585:
----------------------------------
Attachment: SQOOP-2585_1.patch
> merging hive tables using sqoop
> -------------------------------
>
> Key: SQOOP-2585
> URL: https://issues.apache.org/jira/browse/SQOOP-2585
> Project: Sqoop
> Issue Type: New Feature
> Affects Versions: 1.4.6
> Reporter: Rakesh Sharma
> Assignee: Rakesh Sharma
> Fix For: 1.4.7
>
> Attachments: SQOOP-2585_0.patch, SQOOP-2585_1.patch
>
>
> Sqoop currently doesn't support merging of two hive tables. Implement a new tool for Sqoop which :
> 1. merges two hive tables. (This will be most commonly used to merge new incremental data into an existing hive table)
> 2. should support both partitioned and non partitioned tables
> 3. supports merge on composite keys
> 4. For partitioned tables, merges old partitions and also add new partitions.
> 5. Supports Text/RC/ORC/Seq file formats
> 6. Should make sure that only process is performing merge.
> 7. should be an atomic operation. If it fails at any point , it should revert the target hive table to its original state.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)