You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Davies Liu (JIRA)" <ji...@apache.org> on 2015/06/09 08:31:00 UTC

[jira] [Created] (SPARK-8202) PySpark: infinite loop during external sort

Davies Liu created SPARK-8202:
---------------------------------

             Summary: PySpark: infinite loop during external sort 
                 Key: SPARK-8202
                 URL: https://issues.apache.org/jira/browse/SPARK-8202
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 1.4.0
            Reporter: Davies Liu
            Assignee: Davies Liu
            Priority: Critical


The batch size during external sort will grow up to max 10000, then shrink down to zero, causing infinite loop.

Given the assumption that the items usually have similar size, so we don't need to adjust the batch size after first spill.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org