You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Eric Vandenberg (JIRA)" <ji...@apache.org> on 2017/06/26 21:42:00 UTC
[jira] [Created] (SPARK-21219) Task retry occurs on same executor
due to race condition with blacklisting
Eric Vandenberg created SPARK-21219:
---------------------------------------
Summary: Task retry occurs on same executor due to race condition with blacklisting
Key: SPARK-21219
URL: https://issues.apache.org/jira/browse/SPARK-21219
Project: Spark
Issue Type: Bug
Components: Scheduler
Affects Versions: 2.1.1
Reporter: Eric Vandenberg
Priority: Minor
When a task fails it is added into the pending task list and corresponding black list policy is enforced (ie, specifying if it can/can't run on a particular node/executor/etc.) Unfortunately the ordering is such that retrying the task could assign the task to the same executor, which, incidentally could be shutting down and immediately fail the retry. Instead the black list state should be updated and then the task assigned, ensuring that the black list policy is properly enforced.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org