You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Vikram Dixit K (JIRA)" <ji...@apache.org> on 2013/12/06 04:43:36 UTC

[jira] [Created] (HIVE-5973) SMB joins produce incorrect results with multiple partitions and buckets

Vikram Dixit K created HIVE-5973:
------------------------------------

             Summary: SMB joins produce incorrect results with multiple partitions and buckets
                 Key: HIVE-5973
                 URL: https://issues.apache.org/jira/browse/HIVE-5973
             Project: Hive
          Issue Type: Bug
          Components: Query Processor
    Affects Versions: 0.13.0
            Reporter: Vikram Dixit K
            Assignee: Vikram Dixit K
             Fix For: 0.13.0


It looks like there is an issue with re-using the output object array in the select operator. When we read rows of the non-big tables, we hold on to the output object in the priority queue. This causes hive to produce incorrect results because all the elements in the priority queue refer to the same object and the join happens on only one of the buckets.

{noformat}
output[i] = eval[i].evaluate(row);
{noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)