You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by cass savy <ca...@gmail.com> on 2017/05/11 19:31:01 UTC

AWS Cassandra backup/Restore tools

AWS Backup/Restore process/tools for C*/DSE C*:

Has anyone used Opscenter 6.1 backup tool to backup/restore data for larger
datasets online ?

If yes, did you run into issues using that tool to backup/restore data in
PROD that caused any performance or any other impact to the cluster?

If no, what are other tools that people have used or recommended for backup
and restore of Cassandra keyspaces?

Please advice.

RE: AWS Cassandra backup/Restore tools

Posted by "Durity, Sean R" <SE...@homedepot.com>.
Datos IO has a backup/restore product for Cassandra that another team here has used successfully. It solves many of the problems inherent with sstable captures. Without something like it, restores are a nightmare with any volume of data. The downtime required and the loss of data since the snapshot are usually not worth it.


Sean Durity

From: Alexander Dejanovski [mailto:alex@thelastpickle.com]
Sent: Friday, May 12, 2017 12:14 PM
To: Manikandan Srinivasan <ms...@datastax.com>; Nitan Kainth <ni...@bamlabs.com>
Cc: Blake Eggleston <be...@apple.com>; cass savy <ca...@gmail.com>; user@cassandra.apache.org
Subject: Re: AWS Cassandra backup/Restore tools

Hi,

here are the main techniques that I know of to perform backups for Cassandra :

  *   Tablesnap (https://github.com/JeremyGrosser/tablesnap<https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_JeremyGrosser_tablesnap&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=DMw6BkRjlkS9LM5RzcvTamwv8fj6_Czd4RcKBmJnUxc&e=>) : performs continuous backups on S3. Comes with tableslurp to restore backups (one table at a time only) and tablechop to delete outdated sstables from S3.
  *   incremental backup : activate it in the cassandra.yaml file and it will create snapshots for all newly flushed SSTables. It's up to you to move the snapshots off-node and delete them. I don't really like that technique since it creates a lot of small sstables that eventually contain a lot of outdated data. Upon restore you'll have to wait until compaction catches up on compacting all the history (which could take a while and use a lot of power). Your backups could also grow indefinitely with this technique since there's no compaction, so no purge. You'll have to build the restore script/procedure.
  *   scheduled snapshots : you perform full snapshots by yourself and move them off node. You'll have to build the restore script/procedure.
  *   EBS snapshots : probably the easiest way to perform backups if you are using M4/R4 instances on AWS.

Cheers,

On Thu, May 11, 2017 at 11:01 PM Manikandan Srinivasan <ms...@datastax.com>> wrote:
Blake is correct. OpsCenter 6.0 and up doesn't work with OSS C*. @Nitan: We have made some substantial changes to the Opscenter 6.1 backup service, specifically when it comes to S3 backups. Having said this, I am not going to be sale-sy here. If folks need some help or need more clarity to know more about these improvements, please send me an email directly: msrinivasan@datastax.com<ma...@datastax.com>

Regards
Mani

On Thu, May 11, 2017 at 1:54 PM, Nitan Kainth <ni...@bamlabs.com>> wrote:
Also , Opscenter backup/restore does not work for large databases

Sent from my iPhone

On May 11, 2017, at 3:41 PM, Blake Eggleston <be...@apple.com>> wrote:
OpsCenter 6.0 and up don't work with Cassandra.


On May 11, 2017 at 12:31:08 PM, cass savy (casssavy@gmail.com<ma...@gmail.com>) wrote:
AWS Backup/Restore process/tools for C*/DSE C*:

Has anyone used Opscenter 6.1 backup tool to backup/restore data for larger datasets online ?

If yes, did you run into issues using that tool to backup/restore data in PROD that caused any performance or any other impact to the cluster?

If no, what are other tools that people have used or recommended for backup and restore of Cassandra keyspaces?

Please advice.





--
Regards,

Manikandan Srinivasan

Director, Product Management| +1.408.887.3686<tel:%2B1.408.887.3686> | manikandan.srinivasan@datastax.com<ma...@datastax.com>


[Image removed by sender. linkedin.png]<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.linkedin.com_in_srinivm_&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=QVzYL31K-iWGptuTJeKSX2hMW9lrGn5HP3X9p-A8wO4&e=>[Image removed by sender. facebook.png]<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.facebook.com_datastax&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=tCXQZRynu6vGzUuBNtKyhKS0qf1FZcZPAlwGw_5HVBM&e=>[Image removed by sender. twitter.png]<https://urldefense.proofpoint.com/v2/url?u=https-3A__twitter.com_mani-5Fsrini&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=fH6hn8l2gJJVVmpCOoKdXA80OgFPqt6pt3bjR9pzjxI&e=>[Image removed by sender. g+.png]<https://urldefense.proofpoint.com/v2/url?u=https-3A__plus.google.com_-2BDatastax_about&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=2ZG39fHN9Oix46hmXFKRQH5M0AVqA7h-9bqZ7VWvguE&e=>[Image removed by sender.]<https://urldefense.proofpoint.com/v2/url?u=http-3A__feeds.feedburner.com_datastax&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=0Ph6BMhFj73fTtVLvGD9NdZUD0C1ns2EPRjV-xVEDak&e=>[Image removed by sender.]

--
-----------------
Alexander Dejanovski
France
@alexanderdeja

Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.thelastpickle.com_&d=DwMFaQ&c=MtgQEAMQGqekjTjiAhkudQ&r=aC_gxC6z_4f9GLlbWiKzHm1vucZTtVYWDDvyLkh8IaQ&m=hrZJx9SNtdlofcJElpjcjw4rp4rlAq8nZKSSsefCvCc&s=cF2qBFSRkVhyN58YUr5lR9DLcBy0ql8GBMYrBsT1HiA&e=>

________________________________

The information in this Internet Email is confidential and may be legally privileged. It is intended solely for the addressee. Access to this Email by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. When addressed to our clients any opinions or advice contained in this Email are subject to the terms and conditions expressed in any applicable governing The Home Depot terms of business or client engagement letter. The Home Depot disclaims all responsibility and liability for the accuracy and content of this attachment and for any damages or losses arising from any inaccuracies, errors, viruses, e.g., worms, trojan horses, etc., or other items of a destructive nature, which may be contained in this attachment and shall not be liable for direct, indirect, consequential or special damages in connection with this e-mail message or its attachment.

Re: AWS Cassandra backup/Restore tools

Posted by Alexander Dejanovski <al...@thelastpickle.com>.
Hi,

here are the main techniques that I know of to perform backups for
Cassandra :

   - Tablesnap (https://github.com/JeremyGrosser/tablesnap) : performs
   continuous backups on S3. Comes with tableslurp to restore backups (one
   table at a time only) and tablechop to delete outdated sstables from S3.
   - incremental backup : activate it in the cassandra.yaml file and it
   will create snapshots for all newly flushed SSTables. It's up to you to
   move the snapshots off-node and delete them. I don't really like that
   technique since it creates a lot of small sstables that eventually contain
   a lot of outdated data. Upon restore you'll have to wait until compaction
   catches up on compacting all the history (which could take a while and use
   a lot of power). Your backups could also grow indefinitely with this
   technique since there's no compaction, so no purge. You'll have to build
   the restore script/procedure.
   - scheduled snapshots : you perform full snapshots by yourself and move
   them off node. You'll have to build the restore script/procedure.
   - EBS snapshots : probably the easiest way to perform backups if you are
   using M4/R4 instances on AWS.


Cheers,

On Thu, May 11, 2017 at 11:01 PM Manikandan Srinivasan <
msrinivasan@datastax.com> wrote:

> Blake is correct. OpsCenter 6.0 and up doesn't work with OSS C*. @Nitan:
> We have made some substantial changes to the Opscenter 6.1 backup service,
> specifically when it comes to S3 backups. Having said this, I am not going
> to be sale-sy here. If folks need some help or need more clarity to know
> more about these improvements, please send me an email directly:
> msrinivasan@datastax.com
>
> Regards
> Mani
>
> On Thu, May 11, 2017 at 1:54 PM, Nitan Kainth <ni...@bamlabs.com> wrote:
>
>> Also , Opscenter backup/restore does not work for large databases
>>
>> Sent from my iPhone
>>
>> On May 11, 2017, at 3:41 PM, Blake Eggleston <be...@apple.com>
>> wrote:
>>
>> OpsCenter 6.0 and up don't work with Cassandra.
>>
>> On May 11, 2017 at 12:31:08 PM, cass savy (casssavy@gmail.com) wrote:
>>
>> AWS Backup/Restore process/tools for C*/DSE C*:
>>
>> Has anyone used Opscenter 6.1 backup tool to backup/restore data for
>> larger datasets online ?
>>
>> If yes, did you run into issues using that tool to backup/restore data in
>> PROD that caused any performance or any other impact to the cluster?
>>
>> If no, what are other tools that people have used or recommended for
>> backup and restore of Cassandra keyspaces?
>>
>> Please advice.
>>
>>
>>
>
>
> --
> Regards,
>
> Manikandan Srinivasan
>
> Director, Product Management| +1.408.887.3686 |
> manikandan.srinivasan@datastax.com
>
> [image: linkedin.png] <http://www.linkedin.com/in/srinivm/> [image:
> facebook.png] <https://www.facebook.com/datastax> [image: twitter.png]
> <https://twitter.com/mani_srini> [image: g+.png]
> <https://plus.google.com/+Datastax/about>
> <http://feeds.feedburner.com/datastax>
>
> --
-----------------
Alexander Dejanovski
France
@alexanderdeja

Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

Re: AWS Cassandra backup/Restore tools

Posted by Manikandan Srinivasan <ms...@datastax.com>.
Blake is correct. OpsCenter 6.0 and up doesn't work with OSS C*. @Nitan: We
have made some substantial changes to the Opscenter 6.1 backup service,
specifically when it comes to S3 backups. Having said this, I am not going
to be sale-sy here. If folks need some help or need more clarity to know
more about these improvements, please send me an email directly:
msrinivasan@datastax.com

Regards
Mani

On Thu, May 11, 2017 at 1:54 PM, Nitan Kainth <ni...@bamlabs.com> wrote:

> Also , Opscenter backup/restore does not work for large databases
>
> Sent from my iPhone
>
> On May 11, 2017, at 3:41 PM, Blake Eggleston <be...@apple.com> wrote:
>
> OpsCenter 6.0 and up don't work with Cassandra.
>
> On May 11, 2017 at 12:31:08 PM, cass savy (casssavy@gmail.com) wrote:
>
> AWS Backup/Restore process/tools for C*/DSE C*:
>
> Has anyone used Opscenter 6.1 backup tool to backup/restore data for
> larger datasets online ?
>
> If yes, did you run into issues using that tool to backup/restore data in
> PROD that caused any performance or any other impact to the cluster?
>
> If no, what are other tools that people have used or recommended for
> backup and restore of Cassandra keyspaces?
>
> Please advice.
>
>
>


-- 
Regards,

Manikandan Srinivasan

Director, Product Management| +1.408.887.3686 |
manikandan.srinivasan@datastax.com

[image: linkedin.png] <http://www.linkedin.com/in/srinivm/> [image:
facebook.png] <https://www.facebook.com/datastax> [image: twitter.png]
<https://twitter.com/mani_srini> [image: g+.png]
<https://plus.google.com/+Datastax/about>
<http://feeds.feedburner.com/datastax>

Re: AWS Cassandra backup/Restore tools

Posted by Nitan Kainth <ni...@bamlabs.com>.
Also , Opscenter backup/restore does not work for large databases 

Sent from my iPhone

> On May 11, 2017, at 3:41 PM, Blake Eggleston <be...@apple.com> wrote:
> 
> OpsCenter 6.0 and up don't work with Cassandra.
> 
>> On May 11, 2017 at 12:31:08 PM, cass savy (casssavy@gmail.com) wrote:
>> 
>> AWS Backup/Restore process/tools for C*/DSE C*:
>> 
>> Has anyone used Opscenter 6.1 backup tool to backup/restore data for larger datasets online ?
>> 
>> If yes, did you run into issues using that tool to backup/restore data in PROD that caused any performance or any other impact to the cluster?
>> 
>> If no, what are other tools that people have used or recommended for backup and restore of Cassandra keyspaces?
>> 
>> Please advice.
>> 
>> 

Re: AWS Cassandra backup/Restore tools

Posted by Blake Eggleston <be...@apple.com>.
OpsCenter 6.0 and up don't work with Cassandra.

On May 11, 2017 at 12:31:08 PM, cass savy (casssavy@gmail.com) wrote:

AWS Backup/Restore process/tools for C*/DSE C*:

Has anyone used Opscenter 6.1 backup tool to backup/restore data for larger datasets online ?

If yes, did you run into issues using that tool to backup/restore data in PROD that caused any performance or any other impact to the cluster?

If no, what are other tools that people have used or recommended for backup and restore of Cassandra keyspaces?

Please advice.