You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Jun Rao (Jira)" <ji...@apache.org> on 2023/06/05 18:03:00 UTC

[jira] [Created] (KAFKA-15058) Improve the accuracy of Histogram in client metric

Jun Rao created KAFKA-15058:
-------------------------------

             Summary: Improve the accuracy of Histogram in client metric
                 Key: KAFKA-15058
                 URL: https://issues.apache.org/jira/browse/KAFKA-15058
             Project: Kafka
          Issue Type: Improvement
          Components: clients
            Reporter: Jun Rao


The Histogram type (org.apache.kafka.common.metrics.stats) in KafkaMetrics in the client module statically divides the value space into a fixed number of buckets and only returns values on the bucket boundary. So, the returned histogram value may never show up in a recorded value. Yammer Histogram, on the other hand, uses reservoir sampling. The reported value is always one of the recorded values, and is likely more accurate. Because of this, the Histogram type in client metric hasn't been used widely. It would be useful to improve Histogram in the client metric to be more accurate. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)