You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Matei Zaharia (JIRA)" <ji...@apache.org> on 2014/06/06 00:02:03 UTC

[jira] [Created] (SPARK-2043) ExternalAppendOnlyMap doesn't always find matching keys

Matei Zaharia created SPARK-2043:
------------------------------------

             Summary: ExternalAppendOnlyMap doesn't always find matching keys
                 Key: SPARK-2043
                 URL: https://issues.apache.org/jira/browse/SPARK-2043
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 1.0.0, 0.9.1, 0.9.0
            Reporter: Matei Zaharia
            Assignee: Matei Zaharia
            Priority: Blocker


The current implementation reads one key with the next hash code as it finishes reading the keys with the current hash code, which may cause it to miss some matches of the next key. This can cause operations like join to give the wrong result when reduce tasks spill to disk and there are hash collisions, as values won't be matched together.



--
This message was sent by Atlassian JIRA
(v6.2#6252)