You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Yonik Seeley (JIRA)" <ji...@apache.org> on 2015/04/03 17:36:53 UTC

[jira] [Commented] (SOLR-7347) clock skew can cause data loss

    [ https://issues.apache.org/jira/browse/SOLR-7347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14394574#comment-14394574 ] 

Yonik Seeley commented on SOLR-7347:
------------------------------------

The work in SOLR-7332 may be useful to fix this. 

> clock skew can cause data loss
> ------------------------------
>
>                 Key: SOLR-7347
>                 URL: https://issues.apache.org/jira/browse/SOLR-7347
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>            Reporter: Yonik Seeley
>
> The high bits of versions are created using the system clock.
> System clock skew on the order of magnitude of time it takes for one leader to receive it's last update to the time it takes another replica to become a leader can cause data loss for any updates to the same document until the new leaders clock catches up with the old leaders clock.
> 1) replica1 is the leader and indexes document A, choosing version X (and forwards to replicas)
> 2) replica1 goes down
> 3) replica2 becomes the new leader
> 4) replica2 indexes an update for document A, and chooses version Y (which is less than X due to clock skew) and forwards to replica3
> 5) replica3 checks for reordered updates, finds version X and thus drops version Y
> This should be rare... you need a big enough clock skew and updates to the same document with different leaders within that time window.  We should still fix this of course.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org