You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:00:08 UTC

[jira] [Updated] (SPARK-21419) Support Mesos failover_timeout in driver (Mesos cluster mode)

     [ https://issues.apache.org/jira/browse/SPARK-21419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon updated SPARK-21419:
---------------------------------
    Labels: bulk-closed  (was: )

> Support Mesos failover_timeout in driver (Mesos cluster mode)
> -------------------------------------------------------------
>
>                 Key: SPARK-21419
>                 URL: https://issues.apache.org/jira/browse/SPARK-21419
>             Project: Spark
>          Issue Type: Improvement
>          Components: Mesos
>    Affects Versions: 2.2.0
>            Reporter: Susan X. Huynh
>            Priority: Major
>              Labels: bulk-closed
>
> In Mesos cluster mode, the driver framework's failover_timeout is currently set to zero. This means that if the driver temporarily loses connectivity with the master, the driver is considered disconnected, and the master will immediately kill all tasks and executors associated with the framework.
> To avoid this behavior, I would like to make this failover_timeout configurable. A user could then set it to a non-zero value, so that during a temporary disconnection the master would wait before tearing down the framework.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org