You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Jason Gustafson (JIRA)" <ji...@apache.org> on 2017/11/23 00:05:00 UTC

[jira] [Created] (KAFKA-6263) Expose metric for group metadata loading duration

Jason Gustafson created KAFKA-6263:
--------------------------------------

             Summary: Expose metric for group metadata loading duration
                 Key: KAFKA-6263
                 URL: https://issues.apache.org/jira/browse/KAFKA-6263
             Project: Kafka
          Issue Type: Improvement
            Reporter: Jason Gustafson


We have seen in several cases where the log cleaner either wasn't enabled or had experienced some failure that __consumer_offsets partitions can grow excessively. When one of these partitions changes leadership, the new coordinator must load the offset cache from the start of the log, which can take arbitrarily long depending on how large the partition has grown (we have seen cases where it took hours). Catching this problem is not always easy because the condition is rare and the symptom just tends to be a long period of inactivity in the consumer group which gradually gets worse over time. It may therefore be useful to have a broker metric for the load time so that it can be monitored and potentially alerted on. Same thing goes for the transaction log 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)