You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@kudu.apache.org by "Grant Henke (Jira)" <ji...@apache.org> on 2020/06/03 02:52:00 UTC

[jira] [Resolved] (KUDU-2435) Consider non-fatal response to "Tried to update clock beyond the max. error"

     [ https://issues.apache.org/jira/browse/KUDU-2435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Henke resolved KUDU-2435.
-------------------------------
    Fix Version/s: NA
       Resolution: Cannot Reproduce

> Consider non-fatal response to "Tried to update clock beyond the max. error"
> ----------------------------------------------------------------------------
>
>                 Key: KUDU-2435
>                 URL: https://issues.apache.org/jira/browse/KUDU-2435
>             Project: Kudu
>          Issue Type: Improvement
>          Components: server
>    Affects Versions: 1.7.0
>            Reporter: Mike Percy
>            Priority: Major
>             Fix For: NA
>
>
> Currently when one server is skewed, and it tries to replicate to other servers in a cluster, it can cause the rest of the servers in the cluster to crash with the following message:
> {code:java}
> F0428 05:27:23.480379 104613 raft_consensus.cc:1264] Check failed: _s.ok() Bad status: Invalid argument: Tried to update clock beyond the max. error.{code}
> We should consider alternative ways of handling this issue. Maybe the replicas can reject requests that would cause this condition until NTP has a chance to correct the clock of the offending server. We should also consider whether clock skew should be taken into account when doing leader elections... if a server is not within the max clock error of the voter then maybe the vote should be withheld.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)