You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by "Aravindan Vijayan (JIRA)" <ji...@apache.org> on 2016/01/21 20:40:39 UTC
[jira] [Commented] (AMBARI-12376) False Ambari alerts after Ambari
server reboot on secured cluster
[ https://issues.apache.org/jira/browse/AMBARI-12376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15111167#comment-15111167 ]
Aravindan Vijayan commented on AMBARI-12376:
--------------------------------------------
[~ddtd]
Not an ambari-metrics issue. Hence, removed the label.
Do you still need help getting around this issue?
> False Ambari alerts after Ambari server reboot on secured cluster
> -----------------------------------------------------------------
>
> Key: AMBARI-12376
> URL: https://issues.apache.org/jira/browse/AMBARI-12376
> Project: Ambari
> Issue Type: Bug
> Affects Versions: 2.1.0
> Reporter: Dave Disser
>
> HDP 2.3 cluster with Ambari 2.1 build #1319
> Cluster with HA Namenode, HA ResourceManager, HA Oozie, several other HA services installed via blueprint.
> After rebooting Ambari server host (which also has NN, ZK, JN instances), several Ambari alerts persist in this form:
> Percent NodeManagers Available:
> affected: [1], total: [3]
> NodeManager Health :
> Connection failed to http://roller4:8042/ws/v1/node/info (Execution of '/usr/bin/kinit -l 5m -c /var/lib/ambari-agent/data/tmp/nm_health_alert_cc_14246ce5caacfc93af574dc4b896debd -kt /etc/security/keytabs/spnego.service.keytab HTTP/roller4@VM6C1.HADOOP.COM > /dev/null' returned 1. kinit(v5): Cannot contact any KDC for realm 'VM6C1.HADOOP.COM' while getting initial credentials)
> NodeManager Web UI:
> Connection failed to http://roller5:8042 (Execution of '/usr/bin/kinit -l 5m -c /var/lib/ambari-agent/data/tmp/web_alert_cc_866ff322618d226db66f6f893a512256 -kt /etc/security/keytabs/spnego.service.keytab HTTP/roller5@VM6C1.HADOOP.COM > /dev/null' returned 1. kinit(v5): Cannot contact any KDC for realm 'VM6C1.HADOOP.COM' while getting initial credentials)
> (some fqdns redacted)
> Failures are not consistent from test to test, but persist until ambari-server and ambari-agent are restarted on all nodes.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)