You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by "Nicholas Yao (JIRA)" <ji...@apache.org> on 2014/09/12 18:19:33 UTC
[jira] [Created] (AMBARI-7284) Hadoop cluster alerts have not been
updated for Hadoop 2.4 and 2.5
Nicholas Yao created AMBARI-7284:
------------------------------------
Summary: Hadoop cluster alerts have not been updated for Hadoop 2.4 and 2.5
Key: AMBARI-7284
URL: https://issues.apache.org/jira/browse/AMBARI-7284
Project: Ambari
Issue Type: Bug
Affects Versions: 1.6.0
Reporter: Nicholas Yao
many /var/log/message alerts we keyed off of previously are no longer working or valid. It appears that many hadoop 1.x terms such as jobtracker, tasktracker and templeton still exist.
I believe existing rules need to be modified for the follow service name changes:
resourcemanager_process_down
resourcemanager_process_down_ok
resourcemanager_rpc_latency
resourcemanager_rpc_latency_ok
resourcemanager_cpu_utilization
resourcemanager_cpu_utilization_ok
nodemanagers_down
nodemanagers_down_ok
nodemanager_process_down
nodemanager_process_down_ok
webhcat_down
webhcat_down_ok
It also appears that existing messages are getting improperly matched as we see the following HADOOP_UNKNOWN_MSG in /var/log/messages:
Jul 15 10:36:34 pitH1 nagios[35331]: Warning: Hadoop: HADOOP_UNKNOWN_MSG# Event Host=pitH1.td.teradata.com Service Description=HDFS::Percent DataNodes with space available(WARNING), WARNING: total:6, affected:1
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)