You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@edgent.apache.org by "Dale LaBossiere (JIRA)" <ji...@apache.org> on 2017/03/15 13:25:41 UTC

[jira] [Resolved] (EDGENT-396) JobMonitorApp restarts job 3 times more than it should

     [ https://issues.apache.org/jira/browse/EDGENT-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dale LaBossiere resolved EDGENT-396.
------------------------------------
    Resolution: Fixed

resolved via https://github.com/apache/incubator-edgent/pull/298

> JobMonitorApp restarts job 3 times more than it should
> ------------------------------------------------------
>
>                 Key: EDGENT-396
>                 URL: https://issues.apache.org/jira/browse/EDGENT-396
>             Project: Edgent
>          Issue Type: Bug
>          Components: Runtime
>            Reporter: Dale LaBossiere
>            Assignee: Dale LaBossiere
>
> N.B. I don't think this bug affects IotProvider since I'm pretty sure that while IotProvider includes JobMonitorApp, it doesn't register a JobRegistryService so the monitor does nothing.  JIRA forthcoming for that.
> JobMonitorAppTest exercises the app but it doesn't perform any validation that restarts were actually happening.  Adding instrumentation / validation highlights that 3x the number of rebuilds/restarts are happening.
> {code}
> appOne: buildCnt: 7 injectedFailureCnt: 2
> appTwo: buildCnt: 10 injectedFailureCnt: 3
> {code}
> Further investigation identifies the JobMonitorApp's job event filtering as the problem.  Each "failed" job ends up with 3 events that pass through the filter
> {code}
> RUNNING, RUNNING, UNHEALTHY
> RUNNING, CLOSED, UNHEALTHY
> CLOSED, CLOSED, UNHEALTHY
> {code}
> ... or something like that



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)