You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Vikram Dixit K (JIRA)" <ji...@apache.org> on 2014/10/25 01:59:33 UTC

[jira] [Commented] (HIVE-8597) SMB join small table side should use the same set of serialized payloads across tasks

    [ https://issues.apache.org/jira/browse/HIVE-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14183755#comment-14183755 ] 

Vikram Dixit K commented on HIVE-8597:
--------------------------------------

LGTM +1. +1 for 0.14 as well.

> SMB join small table side should use the same set of serialized payloads across tasks
> -------------------------------------------------------------------------------------
>
>                 Key: HIVE-8597
>                 URL: https://issues.apache.org/jira/browse/HIVE-8597
>             Project: Hive
>          Issue Type: Improvement
>          Components: Tez
>    Affects Versions: 0.14.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>             Fix For: 0.14.0
>
>         Attachments: HIVE-8597.1.patch
>
>
> Each task sees all splits belonging to the bucket being processed by the task. At the moment, we end up using different instances of the same serialized split which adds unnecessary memory pressure.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)