You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by "Jonathan Hurley (JIRA)" <ji...@apache.org> on 2016/03/04 20:26:40 UTC

[jira] [Updated] (AMBARI-15303) New Alerts Do Not Honor Existing Maintenance Mode Setting

     [ https://issues.apache.org/jira/browse/AMBARI-15303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Hurley updated AMBARI-15303:
-------------------------------------
    Attachment: AMBARI-15303.patch

> New Alerts Do Not Honor Existing Maintenance Mode Setting
> ---------------------------------------------------------
>
>                 Key: AMBARI-15303
>                 URL: https://issues.apache.org/jira/browse/AMBARI-15303
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-server
>    Affects Versions: 2.0.0
>            Reporter: Jonathan Hurley
>            Assignee: Jonathan Hurley
>            Priority: Critical
>             Fix For: 2.2.0
>
>         Attachments: AMBARI-15303.patch
>
>
> Alerts "suppress" maintenance mode by indicating a {{maintenance_state}} attribute in addition to the actual state which is being reported:
> {code}
>       "Alert": {
>         "cluster_name": "c1",
>         "component_name": "METRICS_COLLECTOR",
>         "definition_id": 43,
>         "definition_name": "ams_metrics_collector_process",
>         "host_name": "c6401.ambari.apache.org",
>         "id": 28,
>         "instance": null,
>         "label": "Metrics Collector Process",
>         "latest_timestamp": 1457108946118,
>         "maintenance_state": "ON",
>         "original_timestamp": 1457108646099,
>         "scope": "ANY",
>         "service_name": "AMBARI_METRICS",
>         "state": "CRITICAL",
>         "text": "Connection failed: [Errno 111] Connection refused to c6401.ambari.apache.org"
>       }
> {code}
> When a host/service/component is placed into MM, the database is updated so that all {{alert_current}} rows which are affected have their MM updated as well.
> However, this fails under two scenarios:
> - The alert hasn't been received yet in a brand new cluster
> - The alert definition was disabled, which removed all current alerts. Then, it was re-enabled.
> In both cases, when constructing a new {{AlertCurrentEntity}}, we need to calculate the correct maintenance state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)