You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Karthik Ranganathan (Updated) (JIRA)" <ji...@apache.org> on 2011/10/19 17:41:10 UTC

[jira] [Updated] (HBASE-4618) HBase backups

     [ https://issues.apache.org/jira/browse/HBASE-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Karthik Ranganathan updated HBASE-4618:
---------------------------------------

    Description: 
We have been working on the ability to do backups in HBase with different levels of protection. This is an umbrella task for all the backup related changes. Here are some kinds of changes - will create separate issues for them:

Roughly here are a few flavors of backups giving increasing levels of guarentees:
1. Per cf backups
2. Multi-cf backups with row atomicity preserved
3. Multi-cf backups with row atomicity and point in time recovery.

On the perf dimension, here is a list of improvements:
1. Copy the files - regular hadoop "cp"
2. Use fast copy - copy blocks and stitch them together, saves top of rack bandwidth
3. Use fast copy with hard links - no file copy, it does only ext3 level linking.

On the durability of data side:
1. Ability to backup data onto the same racks as those running HBase
2. Intra-datacenter backup
3. Inter datacenter backup

Restores:
1. Restore with a table name different from the backed up table name
2. Restore a backed up table wen HBase cluster is not running at restore time
3. Restore into a live and running cluster

Operationally:
1. How to setup backups in live cluster
2. Setting up intra-DC
3. cross-DC backups
4. Verifying a backup is good

  was:
We have been working on the ability to do backups in HBase with different levels of protection. This is an umbrella task for all the backup related changes.

Roughly here are a few flavors of backups giving increasing levels of guarentees:

1. Per cf backups
2. Multi-cf backups with row atomicity preserved
3. Multi-cf backups with row atomicity and point in time recovery.

On the perf dimension, here is a list of improvements:

1. Copy the files - regular hadoop "cp"
2. Use fast copy - copy blocks and stitch them together, saves top of rack bandwidth
3. Use fast copy with hard links - no file copy, it does only ext3 level linking.

    
> HBase backups
> -------------
>
>                 Key: HBASE-4618
>                 URL: https://issues.apache.org/jira/browse/HBASE-4618
>             Project: HBase
>          Issue Type: Umbrella
>          Components: documentation, regionserver
>            Reporter: Karthik Ranganathan
>            Assignee: Karthik Ranganathan
>
> We have been working on the ability to do backups in HBase with different levels of protection. This is an umbrella task for all the backup related changes. Here are some kinds of changes - will create separate issues for them:
> Roughly here are a few flavors of backups giving increasing levels of guarentees:
> 1. Per cf backups
> 2. Multi-cf backups with row atomicity preserved
> 3. Multi-cf backups with row atomicity and point in time recovery.
> On the perf dimension, here is a list of improvements:
> 1. Copy the files - regular hadoop "cp"
> 2. Use fast copy - copy blocks and stitch them together, saves top of rack bandwidth
> 3. Use fast copy with hard links - no file copy, it does only ext3 level linking.
> On the durability of data side:
> 1. Ability to backup data onto the same racks as those running HBase
> 2. Intra-datacenter backup
> 3. Inter datacenter backup
> Restores:
> 1. Restore with a table name different from the backed up table name
> 2. Restore a backed up table wen HBase cluster is not running at restore time
> 3. Restore into a live and running cluster
> Operationally:
> 1. How to setup backups in live cluster
> 2. Setting up intra-DC
> 3. cross-DC backups
> 4. Verifying a backup is good

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira