You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/01/13 08:49:26 UTC

[jira] [Commented] (FLINK-4912) Introduce RECONCILING state in ExecutionGraph

    [ https://issues.apache.org/jira/browse/FLINK-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15821421#comment-15821421 ] 

ASF GitHub Bot commented on FLINK-4912:
---------------------------------------

GitHub user wangzhijiang999 opened a pull request:

    https://github.com/apache/flink/pull/3113

    [FLINK-4912] Introduce RECONCILIATING state in ExecutionGraph and Exe…

    This is part of the non-disruptive JobManager failure recovery.
    
    Add a JobStatus and ExecutionState {{RECONCILING}}.
    If a job is started on a JobManager for master recovery, the job status with all the executions transition to {{RECONCILING}} state.
    
    From {{RECONCILING}}, execution can go to any existing task states (execution reconciled with TaskManager).

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/wangzhijiang999/flink FLINK-4912

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/3113.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3113
    
----
commit 0fbd628b9b8817fd1b71faca92d87c56213d79f6
Author: 淘江 <ta...@alibaba-inc.com>
Date:   2017-01-13T08:41:37Z

    [FLINK-4912] Introduce RECONCILIATING state in ExecutionGraph and Execution for JobManager failure recovery

----


> Introduce RECONCILING state in ExecutionGraph
> ---------------------------------------------
>
>                 Key: FLINK-4912
>                 URL: https://issues.apache.org/jira/browse/FLINK-4912
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Distributed Coordination
>            Reporter: Stephan Ewen
>            Assignee: Zhijiang Wang
>
> This is part of the non-disruptive JobManager failure recovery.
> I suggest to add a JobStatus and ExecutionState {{RECONCILING}}.
> If a job is started on a that JobManager for master recovery (tbd how to determine that) the {{ExecutionGraph}} and the {{Execution}}s start in the reconciling state.
> From {{RECONCILING}}, tasks can go to {{RUNNING}} (execution reconciled with TaskManager) or to {{FAILED}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)