You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2013/01/31 13:11:13 UTC

[jira] [Comment Edited] (HIVE-3403) user should not specify mapjoin to perform sort-merge bucketed join

    [ https://issues.apache.org/jira/browse/HIVE-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13567412#comment-13567412 ] 

Namit Jain edited comment on HIVE-3403 at 1/31/13 12:10 PM:
------------------------------------------------------------

To help	in review, the class hierarchy is:

AbstractBucketJoinProc
       AbstractSMBJoinProc
             SortedMergeBucketMapjoinProc
             SortedMergeJoinProc
       BucketMapjoinOptProc


The context needed is:

BucketJoinOptProcCtx
       SortBucketJoinOptProcCtx

Most of the code in AbstractBucketJoinProc and AbstractSMBJoinProc is old code moved.
BucketMapjoinOptProc is also old code – but there has been little refactoring to break it up into context.

As such, the only new code is SortedMergeJoinProc. Due to the refactoring, I am	able to	re-use a lot of	code
between	map-join and join processing.

                
      was (Author: namit):
    To help	in review, the class hierarchy is:

AbstractBucketJoinProc
     AbstractSMBJoinProc
           SortedMergeBucketMapjoinProc
           SortedMergeJoinProc
     BucketMapjoinOptProc


The context needed is:

BucketJoinOptProcCtx
     SortBucketJoinOptProcCtx

Most of the code in AbstractBucketJoinProc and AbstractSMBJoinProc is old code moved.
BucketMapjoinOptProc is also old code – but there has been little refactoring to break it up into context.

As such, the only new code is SortedMergeJoinProc. Due to the refactoring, I am	able to	re-use a lot of	code
between	map-join and join processing.

                  
> user should not specify mapjoin to perform sort-merge bucketed join
> -------------------------------------------------------------------
>
>                 Key: HIVE-3403
>                 URL: https://issues.apache.org/jira/browse/HIVE-3403
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.3403.10.patch, hive.3403.11.patch, hive.3403.12.patch, hive.3403.13.patch, hive.3403.14.patch, hive.3403.15.patch, hive.3403.16.patch, hive.3403.17.patch, hive.3403.18.patch, hive.3403.19.patch, hive.3403.1.patch, hive.3403.21.patch, hive.3403.22.patch, hive.3403.23.patch, hive.3403.24.patch, hive.3403.2.patch, hive.3403.3.patch, hive.3403.4.patch, hive.3403.5.patch, hive.3403.6.patch, hive.3403.7.patch, hive.3403.8.patch, hive.3403.9.patch
>
>
> Currently, in order to perform a sort merge bucketed join, the user needs
> to set hive.optimize.bucketmapjoin.sortedmerge to true, and also specify the 
> mapjoin hint.
> The user should not specify any hints.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira