You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/09/18 11:32:00 UTC

[jira] [Commented] (FLINK-10311) HA end-to-end/Jepsen tests for standby Dispatchers

    [ https://issues.apache.org/jira/browse/FLINK-10311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16618969#comment-16618969 ] 

ASF GitHub Bot commented on FLINK-10311:
----------------------------------------

GJL opened a new pull request #6712: [FLINK-10311][tests] HA end-to-end/Jepsen tests for standby Dispatchers
URL: https://github.com/apache/flink/pull/6712
 
 
   ## What is the purpose of the change
   
   *This adds tests to verify that jobs can be cancelled properly when there are standby masters.*
   
   cc: @tillrohrmann 
   
   ## Brief change log
   
     - *See commits.*
   
   ## Verifying this change
   
   This change added tests and can be verified as follows:
   
     - *Tested manually.*
     - *The changes themselves are tests.*
     - *Added Clojure unit tests..*
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): (yes / **no**)
     - The public API, i.e., is any changed class annotated with `@Public(Evolving)`: (yes / **no**)
     - The serializers: (yes / **no** / don't know)
     - The runtime per-record code paths (performance sensitive): (yes / **no** / don't know)
     - Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: (yes / **no** / don't know)
     - The S3 file system connector: (yes / **no** / don't know)
   
   ## Documentation
   
     - Does this pull request introduce a new feature? (**yes** / no)
     - If yes, how is the feature documented? (not applicable / **docs** / JavaDocs / not documented)
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> HA end-to-end/Jepsen tests for standby Dispatchers
> --------------------------------------------------
>
>                 Key: FLINK-10311
>                 URL: https://issues.apache.org/jira/browse/FLINK-10311
>             Project: Flink
>          Issue Type: Improvement
>          Components: Tests
>    Affects Versions: 1.7.0
>            Reporter: Till Rohrmann
>            Assignee: Gary Yao
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 1.7.0, 1.6.2, 1.5.5
>
>
> We should add end-to-end or Jepsen tests to verify the HA behaviour when using multiple standby Dispatchers. In particular, we should verify that jobs get properly cleaned up after they finished successfully (see FLINK-10255 and FLINK-10011):
> 1. Test that a standby Dispatcher does not affect job execution and resource cleanup
> 2. Test that a standby Dispatcher recovers all submitted jobs after the leader loses leadership



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)