You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by "Dmytro Sen (JIRA)" <ji...@apache.org> on 2015/11/12 17:54:11 UTC

[jira] [Updated] (AMBARI-13862) Check status of RS process for AMS in embedded mode

     [ https://issues.apache.org/jira/browse/AMBARI-13862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmytro Sen updated AMBARI-13862:
--------------------------------
    Attachment: AMBARI-13862.patch

> Check status of RS process for AMS in embedded mode
> ---------------------------------------------------
>
>                 Key: AMBARI-13862
>                 URL: https://issues.apache.org/jira/browse/AMBARI-13862
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-metrics
>    Affects Versions: 2.1.2
>            Reporter: Dmytro Sen
>            Assignee: Dmytro Sen
>            Priority: Critical
>             Fix For: 2.1.3
>
>         Attachments: AMBARI-13862.patch
>
>
> TimelineMetricStoreWatcher added, it checks TimelineMetricStore functionality by adding and getting a fake metric to/from HBase.
> if adding/getting metric fails 3 times in a row, AHS process shuts down with
> {code}
> 2015-11-12 15:22:10,142 INFO org.apache.hadoop.yarn.server.applicationhistoryservice.metrics.timeline.aggregators.v2.TimelineMetricClusterAggregator: End aggregation cycle @ Thu Nov 12 15:22:10 UTC 2015
> 2015-11-12 15:22:10,141 INFO org.apache.hadoop.yarn.server.applicationhistoryservice.metrics.timeline.aggregators.TimelineMetricClusterAggregatorSecond: Aggregator execution took too long, cancelling sleep. executionTime = 138539
> 2015-11-12 15:22:10,144 WARN org.apache.hadoop.yarn.server.applicationhistoryservice.metrics.timeline.aggregators.TimelineMetricClusterAggregatorSecond: Last Checkpoint is too old, discarding last checkpoint. lastCheckPointTime = Thu Nov 12 15:16:51 UTC 2015
> 2015-11-12 15:22:10,144 INFO org.apache.hadoop.yarn.server.applicationhistoryservice.metrics.timeline.aggregators.TimelineMetricClusterAggregatorSecond: Saving checkpoint time on first run. Thu Nov 12 15:21:10 UTC 2015
> 2015-11-12 15:22:13,700 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Stopping phoenix metrics system...
> 2015-11-12 15:22:13,708 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: phoenix metrics system stopped.
> 2015-11-12 15:22:13,709 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: phoenix metrics system shutdown complete.
> 2015-11-12 15:22:13,709 INFO org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryManagerImpl: Stopping ApplicationHistory
> 2015-11-12 15:22:13,710 INFO org.apache.hadoop.ipc.Server: Stopping server on 60200
> 2015-11-12 15:22:13,716 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server listener on 60200
> 2015-11-12 15:22:13,718 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server Responder
> 2015-11-12 15:22:13,720 INFO org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer: SHUTDOWN_MSG: 
> /************************************************************
> SHUTDOWN_MSG: Shutting down ApplicationHistoryServer at c6404.ambari.apache.org/192.168.64.104
> ************************************************************/
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)