You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Jianguo Tian (JIRA)" <ji...@apache.org> on 2017/06/05 05:36:04 UTC

[jira] [Issue Comment Deleted] (HIVE-11297) Combine op trees for partition info generating tasks [Spark branch]

     [ https://issues.apache.org/jira/browse/HIVE-11297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jianguo Tian updated HIVE-11297:
--------------------------------
    Comment: was deleted

(was: [~csun]: thanks for review, reply you on review board.)

> Combine op trees for partition info generating tasks [Spark branch]
> -------------------------------------------------------------------
>
>                 Key: HIVE-11297
>                 URL: https://issues.apache.org/jira/browse/HIVE-11297
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: spark-branch
>            Reporter: Chao Sun
>            Assignee: liyunzhang_intel
>         Attachments: HIVE-11297.1.patch
>
>
> Currently, for dynamic partition pruning in Spark, if a small table generates partition info for more than one partition columns, multiple operator trees are created, which all start from the same table scan op, but have different spark partition pruning sinks.
> As an optimization, we can combine these op trees and so don't have to do table scan multiple times.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)