You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Malte Pickhan (JIRA)" <ji...@apache.org> on 2016/11/23 07:04:59 UTC

[jira] [Created] (CASSANDRA-12947) Repair not replicating data

Malte Pickhan created CASSANDRA-12947:
-----------------------------------------

             Summary: Repair not replicating data
                 Key: CASSANDRA-12947
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-12947
             Project: Cassandra
          Issue Type: Bug
         Environment: Cassandra 3.7
Single DC
7 Nodes
RF 3
NetworkTopologyStrategy
OS: Ubuntu
            Reporter: Malte Pickhan


We experienced strange behaviour of our C* cluster last week.

In the logs we've seen multiple requests in the logs where we the values read from Cassandra actually have been 'null'. 
When running the query on a node, we sometimes received the result and sometimes not.

When setting the CL to LOCAL_QUORUM everything was fine. So far nothing unusual, probably the dataset wasn't replicated to one of the nodes.

When turning on the tracing and running the query there was following intersting line:

{quote}
Initiating read-repair [SharedPool-Worker-2] | 2016-11-18 10:17:47.528000 |   $PUBLIC_IP |            126 | 127.0.0.1
 Digest mismatch: org.apache.cassandra.service.DigestMismatchException: Mismatch for key DecoratedKey(-5887526567589486157, 3130333031303338383436303937) (db1e86d507513ff12ba95f0eff984b60 vs d41d8cd98f00b204e9800998ecf8427e) [ReadRepairStage:1]
{quote}

This is probably related to CASSANDRA-12090?

The interesting part is, after that we've run a 'nodetool repair -pr', after that the behaviour was still the same and the data randomly not available, depending on which node was hit.

Only after running a 'nodetool repair -pr -full' the issue was gone.

Did we miss something here? The point that's bothering me is that the dataset was not replicated.

Worth to note is probably that some weeks ago we've hit the bug CASSANDRA-12694 and fixed it by scrubbing some tables.

Any hints/help are appreciated.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)