You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Xuefu Zhang (JIRA)" <ji...@apache.org> on 2014/10/27 23:58:34 UTC
[jira] [Commented] (HIVE-8621) Aggregate all small table join data
into broadcast variables
[ https://issues.apache.org/jira/browse/HIVE-8621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185971#comment-14185971 ]
Xuefu Zhang commented on HIVE-8621:
-----------------------------------
Hi [~ssatish], to make explicit, if the table is not bucket, which means n=1, we will broadcast m x 1 variables. Szehon and I was discussing about this, and found that broardcasting m x n variable is a general case that covers map join for both unbucketed tables and bucketed tables. This shouldn't impact your overall design and implementation. We are not clear about how to generate n variables for bucketed table yet, but it seems feasible. Please update the title of ticket if you agree to this minor change. Sorry for the confusion.
> Aggregate all small table join data into broadcast variables
> ------------------------------------------------------------
>
> Key: HIVE-8621
> URL: https://issues.apache.org/jira/browse/HIVE-8621
> Project: Hive
> Issue Type: Sub-task
> Reporter: Suhas Satish
> Assignee: Suhas Satish
>
> This is a sub-task of map-join for spark
> https://issues.apache.org/jira/browse/HIVE-7613
> This can use the baseline patch for map-join
> https://issues.apache.org/jira/browse/HIVE-8616
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)