You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Varun Vasudev (JIRA)" <ji...@apache.org> on 2015/04/02 20:04:56 UTC

[jira] [Updated] (YARN-2901) Add errors and warning stats to RM, NM web UI

     [ https://issues.apache.org/jira/browse/YARN-2901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Varun Vasudev updated YARN-2901:
--------------------------------
    Attachment: apache-yarn-2901.5.patch

{quote}
I realized if we set clean-up-threshold > maxUniqueMessages, user can see it, how about doing clean-up in two conditions:
1) User get message, and #message > maxUniqueMessages
2) #messages > message-threshold, we can set the message-threshold to higher to avoid too frequent cleanup.
Sounds good?
{quote}

Makes sense; made the change.

bq. I just tried to move that, it seems no more issues happen, could you check that?

Moved ErrorAndWarningsBlock to hadoop-yarn-server-common. Renamed ErrorsAndWarningsPage in RM and NM to RMErrorsAndWarningsPage and NMErrorsAndWarningsPage.

> Add errors and warning stats to RM, NM web UI
> ---------------------------------------------
>
>                 Key: YARN-2901
>                 URL: https://issues.apache.org/jira/browse/YARN-2901
>             Project: Hadoop YARN
>          Issue Type: New Feature
>          Components: nodemanager, resourcemanager
>            Reporter: Varun Vasudev
>            Assignee: Varun Vasudev
>         Attachments: Exception collapsed.png, Exception expanded.jpg, Screen Shot 2015-03-19 at 7.40.02 PM.png, apache-yarn-2901.0.patch, apache-yarn-2901.1.patch, apache-yarn-2901.2.patch, apache-yarn-2901.3.patch, apache-yarn-2901.4.patch, apache-yarn-2901.5.patch
>
>
> It would be really useful to have statistics on the number of errors and warnings in the RM and NM web UI. I'm thinking about -
> 1. The number of errors and warnings in the past 5 min/1 hour/12 hours/day
> 2. The top 'n'(20?) most common exceptions in the past 5 min/1 hour/12 hours/day
> By errors and warnings I'm referring to the log level.
> I suspect we can probably achieve this by writing a custom appender?(I'm open to suggestions on alternate mechanisms for implementing this).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)