You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Xuefu Zhang (JIRA)" <ji...@apache.org> on 2014/10/31 16:37:35 UTC

[jira] [Commented] (HIVE-8623) Implement SparkHashTableLoader for map-join [Spark Branch]

    [ https://issues.apache.org/jira/browse/HIVE-8623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14191952#comment-14191952 ] 

Xuefu Zhang commented on HIVE-8623:
-----------------------------------

Based on the latest discussion, we are going to take distributed cache approach to broadcast small tables. The goal of this task is to check if current MR's HashTableLoader can be reused for Spark. Specifically, if it can read for a table or a bucket multiple files. Enhancing HashTableLoader is preferred.

> Implement SparkHashTableLoader for map-join [Spark Branch]
> ----------------------------------------------------------
>
>                 Key: HIVE-8623
>                 URL: https://issues.apache.org/jira/browse/HIVE-8623
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Suhas Satish
>            Assignee: Jimmy Xiang
>
> This is a sub-task of map-join for spark 
> https://issues.apache.org/jira/browse/HIVE-7613
> This can use the baseline patch for map-join
> https://issues.apache.org/jira/browse/HIVE-8616



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)