You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Sophie Blee-Goldman (Jira)" <ji...@apache.org> on 2019/11/13 00:09:00 UTC

[jira] [Created] (KAFKA-9177) Pause completed partitions on restore consumer

Sophie Blee-Goldman created KAFKA-9177:
------------------------------------------

             Summary: Pause completed partitions on restore consumer
                 Key: KAFKA-9177
                 URL: https://issues.apache.org/jira/browse/KAFKA-9177
             Project: Kafka
          Issue Type: Improvement
          Components: streams
            Reporter: Sophie Blee-Goldman


The StoreChangelogReader is responsible for tracking and restoring active tasks, but once a store has finished restoring it will continue polling for records on that partition.

Ordinarily this doesn't make a difference as a store is not completely restored until its entire changelog has been read, so there are no more records for poll to return anyway. But if the restoring state is actually an optimized source KTable, the changelog is just the source topic and poll will keep returning records for that partition until all stores have been restored.

Note that this isn't a correctness issue since it's just the restore consumer, but it is wasteful to be polling for records and throwing them away. We should pause completed partitions in StoreChangelogReader so we don't slow down the restore consumer in reading from the unfinished changelog topics, and avoid wasted network.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)