You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Mike Percy (JIRA)" <ji...@apache.org> on 2017/08/28 19:07:00 UTC

[jira] [Resolved] (KUDU-2114) Master asks tservers to re-delete tombstoned tablets when reported

     [ https://issues.apache.org/jira/browse/KUDU-2114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mike Percy resolved KUDU-2114.
------------------------------
       Resolution: Fixed
    Fix Version/s: 1.5.0

Pushed in 22a19d93ab9f1df339b8fe0fde094c406ce5cb2b

> Master asks tservers to re-delete tombstoned tablets when reported
> ------------------------------------------------------------------
>
>                 Key: KUDU-2114
>                 URL: https://issues.apache.org/jira/browse/KUDU-2114
>             Project: Kudu
>          Issue Type: Bug
>          Components: consensus, master, tserver
>    Affects Versions: 1.5.0
>            Reporter: Adar Dembo
>            Assignee: Mike Percy
>            Priority: Blocker
>             Fix For: 1.5.0
>
>
> Commit 5bca7d8 changed the behavior of tombstoned replicas such that they now retain RaftConsensus instances despite being in the TOMBSTONED state. This means that some additional consensus-related state is included in their tablet report entries when a full tablet report is sent to the master. The master evaluates this consensus-related state when considering whether an evicted replica should be deleted, but it does not consider the TOMBSTONED state. As a result, the master notices that these tombstones replicas have been evicted, and asks the hosting tserver to delete them again. This will happen any time there is a tablet report.
> This needs to be fixed, whether by excluding tombstone consensus state from tablet reports, or by changing the master to consider the tablet's overall state when deciding whether to delete it.
> When observed on a live cluster, it was further observed that the tablet deletion requests were rather expensive. It appears that a DeleteTablet RPC on a tombstone is not a no-op; it always flushes the superblock twice, which generates two fsyncs. This should also be addressed.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)