You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Grant Henke (Jira)" <ji...@apache.org> on 2020/06/01 18:17:00 UTC

[jira] [Updated] (KUDU-1525) Create metrics for errors

     [ https://issues.apache.org/jira/browse/KUDU-1525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Henke updated KUDU-1525:
------------------------------
    Component/s: metrics

> Create metrics for errors
> -------------------------
>
>                 Key: KUDU-1525
>                 URL: https://issues.apache.org/jira/browse/KUDU-1525
>             Project: Kudu
>          Issue Type: Improvement
>          Components: metrics, supportability
>            Reporter: Jean-Daniel Cryans
>            Priority: Major
>
> There's a class of issue that can be hard to debug, namely when things fail semi-silently on the client-side. We currently have glog_warning_messages and glog_error_messages, but it could be good to have more granular metrics. A few I have in mind:
>  - rpc errors, basically any "recv error"
>  - server-level errors, like when it says TOO BUSY.
>  - any kind of insert rejection, right now we have row key duplicates and memory pressure, but we're missing things like txn_tracker rejections, "not a leader".
>  - raft errors like dropping a follower because we don't have the WALs around and it's lagging too much.
> There's probably more but the above would be a good start.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)