You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Yuping Wang (JIRA)" <ji...@apache.org> on 2019/06/30 14:42:00 UTC

[jira] [Created] (CASSANDRA-15190) Is there a way to auto-clean up data whenever an old node rejoins the cluster

Yuping Wang created CASSANDRA-15190:
---------------------------------------

             Summary: Is there a way to auto-clean up data whenever an old node rejoins the cluster 
                 Key: CASSANDRA-15190
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-15190
             Project: Cassandra
          Issue Type: Bug
            Reporter: Yuping Wang


Due to specific reasons of our use case,  we have gc grace to 15 minutes instead of default 10 days. Since we have a large amount of nodes in our Cassandra fleet, not surprisingly, we encounter occasionally  node status going from up to down and up again. The problem is when the down node rejoins the cluster after 15 minutes, it automatically adds already deleted data back and causing zombie data.

our questions:
 # Is there a way to not allow a down node to rejoin the cluster?
 # or is there a way to configure rejoining node not adding stale data back regardless of how long the node is down before rejoining
 # or is there a way to auto clean up the data when rejoining ?

We know adding those data back is a conservative approach to avoid data loss but in our specific case, we are not worried about deleted data being revived.... we don’t have such use case. We really need a non-defaul option to never add back deleted data on rejoining nodes.

this functionality will ultimately be a deciding factor on whether we can continue with Cassandra.

 

Please  help!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org
For additional commands, e-mail: commits-help@cassandra.apache.org