You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Matei Zaharia (JIRA)" <ji...@apache.org> on 2014/06/06 00:02:03 UTC
[jira] [Created] (SPARK-2043) ExternalAppendOnlyMap doesn't always
find matching keys
Matei Zaharia created SPARK-2043:
------------------------------------
Summary: ExternalAppendOnlyMap doesn't always find matching keys
Key: SPARK-2043
URL: https://issues.apache.org/jira/browse/SPARK-2043
Project: Spark
Issue Type: Bug
Components: Spark Core
Affects Versions: 1.0.0, 0.9.1, 0.9.0
Reporter: Matei Zaharia
Assignee: Matei Zaharia
Priority: Blocker
The current implementation reads one key with the next hash code as it finishes reading the keys with the current hash code, which may cause it to miss some matches of the next key. This can cause operations like join to give the wrong result when reduce tasks spill to disk and there are hash collisions, as values won't be matched together.
--
This message was sent by Atlassian JIRA
(v6.2#6252)