You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Avraham Kalvo (JIRA)" <ji...@apache.org> on 2019/01/09 07:04:00 UTC
[jira] [Comment Edited] (CASSANDRA-14957) Rolling Restart Of Nodes Cause Dataloss Due To Schema Collision

    [ https://issues.apache.org/jira/browse/CASSANDRA-14957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16737908#comment-16737908 ] 

Avraham Kalvo edited comment on CASSANDRA-14957 at 1/9/19 7:03 AM:
-------------------------------------------------------------------

Thanks [~jeromatron],

Looking at the data directory for one of the keyspaces in question right now, even a couple of full primary range repairs have completed across the cluster since the outage, the following is apparent:

```
36 drwxr-xr-x. 4 root root 28672 Jan  2 12:07 tasks-bd7200a0156711e88974855d74ee356f
   8 drwxr-xr-x. 4 root root  4096 Jan  9 06:38 tasks-bd750de0156711e8bdc54f7bcdcb851f
```
and with the following sizes:
```
$ du -sh tasks*
2.7G	tasks-bd7200a0156711e88974855d74ee356f
522M	tasks-bd750de0156711e8bdc54f7bcdcb851f
```

Data was not lost from disk, but became no longer available for reads/writes via the database, I.e. - effectively lost to the application.
As far as I know, anti-entropy actions don't take care of the above situation and indeed it needed to be recovered manually as described in the original comment for this issue.
Writes only began to succeed once the schema agreement has settled across all the cluster, Until then, the application was timing out on any request to Cassandra. 

What do you think?


was (Author: via.vokal):
Thanks [~jeromatron],

Looking at the data directory for one of the keyspaces in question right now, even a couple of full primary range repairs have completed across the cluster since the outage, the following is apparent:

```
36 drwxr-xr-x. 4 root root 28672 Jan  2 12:07 tasks-bd7200a0156711e88974855d74ee356f
   8 drwxr-xr-x. 4 root root  4096 Jan  9 06:38 tasks-bd750de0156711e8bdc54f7bcdcb851f
```

Data was not lost from disk, but became no longer available for reads/writes via the database, I.e. - effectively lost to the application.
As far as I know, anti-entropy actions don't take care of the above situation and indeed it needed to be recovered manually as described in the original comment for this issue.
Writes only began to succeed once the schema agreement has settled across all the cluster, Until then, the application was timing out on any request to Cassandra. 

What do you think?

> Rolling Restart Of Nodes Cause Dataloss Due To Schema Collision
> ---------------------------------------------------------------
>
>                 Key: CASSANDRA-14957
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-14957
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Cluster/Schema
>            Reporter: Avraham Kalvo
>            Priority: Major
>
> We were issuing a rolling restart on a mission-critical five node C* cluster.
> The first node which was restarted got the following messages in its system.log:
> ```
> January 2nd 2019, 12:06:37.310 - INFO 12:06:35 Initializing tasks_scheduler_external.tasks
> ```
> ```
> WARN 12:06:39 UnknownColumnFamilyException reading from socket; closing
> org.apache.cassandra.db.UnknownColumnFamilyException: Couldn't find table for cfId bd7200a0-1567-11e8-8974-855d74ee356f. If a table was just created, this is likely due to the schema not being fully propagated. Please wait for schema agreement on table creation.
> at org.apache.cassandra.config.CFMetaData$Serializer.deserialize(CFMetaData.java:1336) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.db.partitions.PartitionUpdate$PartitionUpdateSerializer.deserialize30(PartitionUpdate.java:660) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.db.partitions.PartitionUpdate$PartitionUpdateSerializer.deserialize(PartitionUpdate.java:635) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.db.Mutation$MutationSerializer.deserialize(Mutation.java:330) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.db.Mutation$MutationSerializer.deserialize(Mutation.java:349) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.db.Mutation$MutationSerializer.deserialize(Mutation.java:286) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.net.MessageIn.read(MessageIn.java:98) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.net.IncomingTcpConnection.receiveMessage(IncomingTcpConnection.java:201) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.net.IncomingTcpConnection.receiveMessages(IncomingTcpConnection.java:178) ~[apache-cassandra-3.0.10.jar:3.0.10]
> at org.apache.cassandra.net.IncomingTcpConnection.run(IncomingTcpConnection.java:92) ~[apache-cassandra-3.0.10.jar:3.0.10]
> ```
> The latter was then repeated several times across the cluster.
> It was then found out that the table in question `tasks_scheduler_external.tasks` was created with a new schema version after the entire cluster was restarted consecutively and schema agreement settled, which started taking requests leaving the previous version of the schema unavailable for any request, thus generating a data loss to our online system.
> Data loss was recovered by manually copying SSTables from the previous version directory of the schema to the new one followed by `nodetool refresh` to the relevant table.
> The above has repeated itself for several tables across various keyspaces.
> One other thing to mention is that a repair was in place for the first node to be restarted, which was obviously stopped as the daemon was shut down, but this doesn't seem to do with the above at first glance.
> Seems somewhat related to:
> https://issues.apache.org/jira/browse/CASSANDRA-13559



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org
For additional commands, e-mail: commits-help@cassandra.apache.org