You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2014/05/20 04:59:37 UTC

[jira] [Created] (SPARK-1888) enhance MEMORY_AND_DISK mode by dropping blocks in parallel

Wenchen Fan created SPARK-1888:
----------------------------------

             Summary: enhance MEMORY_AND_DISK mode by dropping blocks in parallel
                 Key: SPARK-1888
                 URL: https://issues.apache.org/jira/browse/SPARK-1888
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
            Reporter: Wenchen Fan


Sometimes MEMORY_AND_DISK mode is slower than DISK_ONLY mode because of the lock on IO operations(dropping blocks in memory store). As the TODO says, the solution is: only synchronize the selecting of to-be-dropped blocks and do the dropping in parallel. I have a quick fix in my PR: https://github.com/apache/spark/pull/791#issuecomment-43567924
It's fragile currently  but I'm working on it to make it more robust.



--
This message was sent by Atlassian JIRA
(v6.2#6252)