You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Davies Liu (JIRA)" <ji...@apache.org> on 2014/08/14 08:45:11 UTC
[jira] [Created] (SPARK-3030) reuse python worker
Davies Liu created SPARK-3030:
---------------------------------
Summary: reuse python worker
Key: SPARK-3030
URL: https://issues.apache.org/jira/browse/SPARK-3030
Project: Spark
Issue Type: Improvement
Components: PySpark
Reporter: Davies Liu
Currently, it will fork an Python worker for each task, it will better if we can reuse the worker for later tasks.
This will be very useful for large dataset with big broadcast, so it does not need to sending broadcast to worker again and again. Also it can reduce the overhead of launch a task.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org