You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Feng Peng (JIRA)" <ji...@apache.org> on 2013/03/20 23:23:15 UTC

[jira] [Commented] (HIVE-1161) Hive Replication

    [ https://issues.apache.org/jira/browse/HIVE-1161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13608323#comment-13608323 ] 

Feng Peng commented on HIVE-1161:
---------------------------------

Is anyone planning to work on this issue? We plan to provide the replication between two clusters on the partition level, i.e., given a source cluster and a target cluster, we can specify a table and the tool would sync all the updated partitions from the source cluster to the target cluster, and create the table on the target cluster if it doesn't already exist.

I saw the comments from Namit (http://grokbase.com/t/hive/user/102br2zfae/best-way-to-move-hive-tables-data-from-one-hadoop-cluster-to-another) and am wondering if this is something already done somewhere and if it can be shared.

                
> Hive Replication
> ----------------
>
>                 Key: HIVE-1161
>                 URL: https://issues.apache.org/jira/browse/HIVE-1161
>             Project: Hive
>          Issue Type: New Feature
>          Components: Contrib
>            Reporter: Edward Capriolo
>            Assignee: Edward Capriolo
>            Priority: Minor
>
> Users may want to replicate data between two distinct hadoop clusters or two hive warehouses on the same cluster.
> Users may want to replicate entire catalogs or possibly on a table by table basis. Should this process be batch driven or a be a full time running application? What are some practical requirements, what are the limitations?
> Comments?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira