You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Susan X. Huynh (JIRA)" <ji...@apache.org> on 2017/07/14 15:16:00 UTC
[jira] [Created] (SPARK-21419) Support Mesos failover_timeout in
driver (Mesos cluster mode)
Susan X. Huynh created SPARK-21419:
--------------------------------------
Summary: Support Mesos failover_timeout in driver (Mesos cluster mode)
Key: SPARK-21419
URL: https://issues.apache.org/jira/browse/SPARK-21419
Project: Spark
Issue Type: Improvement
Components: Mesos
Affects Versions: 2.2.0
Reporter: Susan X. Huynh
In Mesos cluster mode, the driver framework's failover_timeout is currently set to zero. This means that if the driver temporarily loses connectivity with the master, the driver is considered disconnected, and the master will immediately kill all tasks and executors associated with the framework.
To avoid this behavior, I would like to make this failover_timeout configurable. A user could then set it to a non-zero value, so that during a temporary disconnection the master would wait before tearing down the framework.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org