You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2019/03/28 15:02:01 UTC

[jira] [Commented] (KAFKA-7981) Add Replica Fetcher and Log Cleaner Count Metrics

    [ https://issues.apache.org/jira/browse/KAFKA-7981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16804012#comment-16804012 ] 

ASF GitHub Bot commented on KAFKA-7981:
---------------------------------------

viktorsomogyi commented on pull request #6514: KAFKA-7981: Add fetcher and log cleaner thread count metrics
URL: https://github.com/apache/kafka/pull/6514
 
 
   ### Committer Checklist (excluded from commit message)
   - [ ] Verify design and implementation 
   - [ ] Verify test coverage and CI build status
   - [ ] Verify documentation (including upgrade notes)
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Add Replica Fetcher and Log Cleaner Count Metrics
> -------------------------------------------------
>
>                 Key: KAFKA-7981
>                 URL: https://issues.apache.org/jira/browse/KAFKA-7981
>             Project: Kafka
>          Issue Type: Improvement
>          Components: metrics
>    Affects Versions: 2.3.0
>            Reporter: Viktor Somogyi-Vass
>            Assignee: Viktor Somogyi-Vass
>            Priority: Major
>              Labels: kip
>
> In some occasions we detected errors where replica fetcher threads or log cleaners died because of an unrecoverable error and caused more serious issues in the brokers (from lagging to offline replicas, filling up disks, etc.). It would often help if the monitoring systems attached to Kafka could detect these problems early on as it would allow a prompt response from the user and the greater possibility of capturing the root cause.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)