You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2018/01/30 09:51:00 UTC

[jira] [Closed] (FLINK-8488) Dispatcher does not recover Jobs

     [ https://issues.apache.org/jira/browse/FLINK-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Till Rohrmann closed FLINK-8488.
--------------------------------
    Resolution: Fixed

Fixed via e2c2cf414ca1224fbbcf9115352a9972eaa047b8

> Dispatcher does not recover Jobs
> --------------------------------
>
>                 Key: FLINK-8488
>                 URL: https://issues.apache.org/jira/browse/FLINK-8488
>             Project: Flink
>          Issue Type: Bug
>          Components: Distributed Coordination
>    Affects Versions: 1.5.0
>         Environment: 776af4a882c85926fc0764b702fec717c675e34c
>            Reporter: Gary Yao
>            Assignee: Till Rohrmann
>            Priority: Blocker
>              Labels: flip-6
>             Fix For: 1.5.0
>
>
> Dispatcher does not recover jobs on failover (FLIP-6 mode).
> *Steps to reproduce*:
>  # {{bin/start-cluster.sh flip6}}
>  # {{bin/flink run -p1 -flip6 examples/batch/WordCount.jar --input /path/to/largefile.txt}}
>  # Wait until job is running, then run {{bin/jobmanager.sh stop flip6 && bin/jobmanager.sh start flip6}} to restart the master.
>  # Wait until leader is elected and verify that no jobs are running.
> *Analysis*
>  * Dispatcher checks on {{submitJob}} whether the job scheduling status is {{PENDING}} and only then allows resubmission of the job. However, the job is marked as {{RUNNING}} in ZooKeeper.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)