You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2013/01/31 13:11:13 UTC
[jira] [Comment Edited] (HIVE-3403) user should not specify mapjoin
to perform sort-merge bucketed join
[ https://issues.apache.org/jira/browse/HIVE-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13567412#comment-13567412 ]
Namit Jain edited comment on HIVE-3403 at 1/31/13 12:10 PM:
------------------------------------------------------------
To help in review, the class hierarchy is:
AbstractBucketJoinProc
AbstractSMBJoinProc
SortedMergeBucketMapjoinProc
SortedMergeJoinProc
BucketMapjoinOptProc
The context needed is:
BucketJoinOptProcCtx
SortBucketJoinOptProcCtx
Most of the code in AbstractBucketJoinProc and AbstractSMBJoinProc is old code moved.
BucketMapjoinOptProc is also old code – but there has been little refactoring to break it up into context.
As such, the only new code is SortedMergeJoinProc. Due to the refactoring, I am able to re-use a lot of code
between map-join and join processing.
was (Author: namit):
To help in review, the class hierarchy is:
AbstractBucketJoinProc
AbstractSMBJoinProc
SortedMergeBucketMapjoinProc
SortedMergeJoinProc
BucketMapjoinOptProc
The context needed is:
BucketJoinOptProcCtx
SortBucketJoinOptProcCtx
Most of the code in AbstractBucketJoinProc and AbstractSMBJoinProc is old code moved.
BucketMapjoinOptProc is also old code – but there has been little refactoring to break it up into context.
As such, the only new code is SortedMergeJoinProc. Due to the refactoring, I am able to re-use a lot of code
between map-join and join processing.
> user should not specify mapjoin to perform sort-merge bucketed join
> -------------------------------------------------------------------
>
> Key: HIVE-3403
> URL: https://issues.apache.org/jira/browse/HIVE-3403
> Project: Hive
> Issue Type: Bug
> Reporter: Namit Jain
> Assignee: Namit Jain
> Attachments: hive.3403.10.patch, hive.3403.11.patch, hive.3403.12.patch, hive.3403.13.patch, hive.3403.14.patch, hive.3403.15.patch, hive.3403.16.patch, hive.3403.17.patch, hive.3403.18.patch, hive.3403.19.patch, hive.3403.1.patch, hive.3403.21.patch, hive.3403.22.patch, hive.3403.23.patch, hive.3403.24.patch, hive.3403.2.patch, hive.3403.3.patch, hive.3403.4.patch, hive.3403.5.patch, hive.3403.6.patch, hive.3403.7.patch, hive.3403.8.patch, hive.3403.9.patch
>
>
> Currently, in order to perform a sort merge bucketed join, the user needs
> to set hive.optimize.bucketmapjoin.sortedmerge to true, and also specify the
> mapjoin hint.
> The user should not specify any hints.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira