You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Xuefu Zhang (JIRA)" <ji...@apache.org> on 2014/10/27 23:58:34 UTC

[jira] [Commented] (HIVE-8621) Aggregate all small table join data into broadcast variables

    [ https://issues.apache.org/jira/browse/HIVE-8621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185971#comment-14185971 ] 

Xuefu Zhang commented on HIVE-8621:
-----------------------------------

Hi [~ssatish], to make explicit, if the table is not bucket, which means n=1, we will broadcast m x 1 variables. Szehon and I was discussing about this, and found that broardcasting m x n variable is a general case that covers map join for both unbucketed tables and bucketed tables. This shouldn't impact your overall design and implementation. We are not clear about how to generate n variables for bucketed table yet, but it seems feasible. Please update the title of ticket if you agree to this minor change. Sorry for the confusion.

> Aggregate all small table join data into broadcast variables
> ------------------------------------------------------------
>
>                 Key: HIVE-8621
>                 URL: https://issues.apache.org/jira/browse/HIVE-8621
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Suhas Satish
>            Assignee: Suhas Satish
>
> This is a sub-task of map-join for spark 
> https://issues.apache.org/jira/browse/HIVE-7613
> This can use the baseline patch for map-join
> https://issues.apache.org/jira/browse/HIVE-8616



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)