You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Sophie Blee-Goldman (Jira)" <ji...@apache.org> on 2020/06/11 02:22:00 UTC

[jira] [Created] (KAFKA-10144) Corrupted standby tasks are not always cleaned up

Sophie Blee-Goldman created KAFKA-10144:
-------------------------------------------

             Summary: Corrupted standby tasks are not always cleaned up
                 Key: KAFKA-10144
                 URL: https://issues.apache.org/jira/browse/KAFKA-10144
             Project: Kafka
          Issue Type: Bug
          Components: streams
            Reporter: Sophie Blee-Goldman
            Assignee: Sophie Blee-Goldman
             Fix For: 2.6.0


Thread death on the 2.6-eos-beta soak was due to re-registration of a standby task changelog that was already registered. The root cause was that the task had been marked corrupted, but `commit` threw a TaskMigratedException before we could get to calling TaskManager#handleCorruption and properly clean up the task.

For corrupted active tasks this is not a problem, since #handleLostAll will then finish the cleanup. But we intentionally don't clear standbys tasks on TaskMigratedException, leaving the task corrupted and partially registered



--
This message was sent by Atlassian Jira
(v8.3.4#803005)