You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@helix.apache.org by "Hunter L (JIRA)" <ji...@apache.org> on 2018/11/01 23:56:00 UTC

[jira] [Created] (HELIX-783) TASK: Fix JobQueue's job state-related bug

Hunter L created HELIX-783:
------------------------------

             Summary: TASK: Fix JobQueue's job state-related bug
                 Key: HELIX-783
                 URL: https://issues.apache.org/jira/browse/HELIX-783
             Project: Apache Helix
          Issue Type: Improvement
            Reporter: Hunter L
            Assignee: Hunter L


The bug was observed in TestTaskRebalancerStopResume:stopAndResumeNamedQueue(), which was being unstable. It was observed that for JobQueues with multiple jobs, the second job would get marked as IN_PROGRESS even though the first job hadn't completed/failed, especially when the queue was being stopped and resumed. This was due to a bug in getIncompleteJobCount() because it was not counting jobs in STOPPING state. This was fixed and another check was added right before JobDispatcher marks a job as STOPPED so that it would not mark it STOPPED if the job state is NOT_STARTED. Changelist: 1. Fix getIncompleteJobCount() 2. Add a check so that we don't mark NOT_STARTED jobs as STOPPED



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)