You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2014/05/20 04:59:37 UTC
[jira] [Created] (SPARK-1888) enhance MEMORY_AND_DISK mode by
dropping blocks in parallel
Wenchen Fan created SPARK-1888:
----------------------------------
Summary: enhance MEMORY_AND_DISK mode by dropping blocks in parallel
Key: SPARK-1888
URL: https://issues.apache.org/jira/browse/SPARK-1888
Project: Spark
Issue Type: Improvement
Components: Spark Core
Reporter: Wenchen Fan
Sometimes MEMORY_AND_DISK mode is slower than DISK_ONLY mode because of the lock on IO operations(dropping blocks in memory store). As the TODO says, the solution is: only synchronize the selecting of to-be-dropped blocks and do the dropping in parallel. I have a quick fix in my PR: https://github.com/apache/spark/pull/791#issuecomment-43567924
It's fragile currently but I'm working on it to make it more robust.
--
This message was sent by Atlassian JIRA
(v6.2#6252)