You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@edgent.apache.org by "Dale LaBossiere (JIRA)" <ji...@apache.org> on 2017/03/14 19:02:41 UTC
[jira] [Created] (EDGENT-396) JobMonitorApp restarts job 3 times
more than it should
Dale LaBossiere created EDGENT-396:
--------------------------------------
Summary: JobMonitorApp restarts job 3 times more than it should
Key: EDGENT-396
URL: https://issues.apache.org/jira/browse/EDGENT-396
Project: Edgent
Issue Type: Bug
Components: Runtime
Reporter: Dale LaBossiere
Assignee: Dale LaBossiere
N.B. I don't think this bug affects IotProvider since I'm pretty sure that while IotProvider includes JobMonitorApp, it doesn't register a JobRegistryService so the monitor does nothing. JIRA forthcoming for that.
JobMonitorAppTest exercises the app but it doesn't perform any validation that restarts were actually happening. Adding instrumentation / validation highlights that 3x the number of rebuilds/restarts are happening.
{code}
appOne: buildCnt: 7 injectedFailureCnt: 2
appTwo: buildCnt: 10 injectedFailureCnt: 3
{code}
Further investigation identifies the JobMonitorApp's job event filtering as the problem. Each "failed" job ends up with 3 events that pass through the filter
{code}
RUNNING, RUNNING, UNHEALTHY
RUNNING, CLOSED, UNHEALTHY
CLOSED, CLOSED, UNHEALTHY
{code}
... or something like that
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)