You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Kopal Niranjan (JIRA)" <ji...@apache.org> on 2016/04/19 11:48:25 UTC

[jira] [Updated] (SQOOP-2585) merging hive tables using sqoop

     [ https://issues.apache.org/jira/browse/SQOOP-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kopal Niranjan updated SQOOP-2585:
----------------------------------
    Attachment: SQOOP-2585_1.patch

> merging hive tables using sqoop
> -------------------------------
>
>                 Key: SQOOP-2585
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2585
>             Project: Sqoop
>          Issue Type: New Feature
>    Affects Versions: 1.4.6
>            Reporter: Rakesh Sharma
>            Assignee: Rakesh Sharma
>             Fix For: 1.4.7
>
>         Attachments: SQOOP-2585_0.patch, SQOOP-2585_1.patch
>
>
> Sqoop currently doesn't support merging of two hive tables. Implement a new tool for Sqoop which :
> 1. merges two hive tables. (This will be most commonly used to merge new incremental data into an existing hive table)
> 2. should support both partitioned and non partitioned tables
> 3. supports merge on composite keys
> 4. For partitioned tables, merges old partitions and also add new partitions.
> 5. Supports Text/RC/ORC/Seq file formats
> 6. Should make sure that only process is performing merge.
> 7. should be an atomic operation. If it fails at any point , it should revert the target hive table to its original state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)