You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2023/04/05 07:14:00 UTC

[jira] [Work logged] (HIVE-26968) SharedWorkOptimizer merges TableScan operators that have different DPP parents

     [ https://issues.apache.org/jira/browse/HIVE-26968?focusedWorklogId=854950&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-854950 ]

ASF GitHub Bot logged work on HIVE-26968:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 05/Apr/23 07:13
            Start Date: 05/Apr/23 07:13
    Worklog Time Spent: 10m 
      Work Description: zabetak commented on PR #3981:
URL: https://github.com/apache/hive/pull/3981#issuecomment-1497026153

   Thanks for the update @ngsg ; I am bit underwater these days but this is on my TODO list! 




Issue Time Tracking
-------------------

    Worklog Id:     (was: 854950)
    Time Spent: 50m  (was: 40m)

> SharedWorkOptimizer merges TableScan operators that have different DPP parents
> ------------------------------------------------------------------------------
>
>                 Key: HIVE-26968
>                 URL: https://issues.apache.org/jira/browse/HIVE-26968
>             Project: Hive
>          Issue Type: Sub-task
>    Affects Versions: 4.0.0-alpha-2
>            Reporter: Seonggon Namgung
>            Assignee: Seonggon Namgung
>            Priority: Critical
>              Labels: hive-4.0.0-must, pull-request-available
>         Attachments: TPC-DS Query64 OperatorGraph.pdf
>
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> SharedWorkOptimizer merges TableScan operators that have different DPP parents, which leads to the creation of semantically wrong query plan.
> In our environment, running TPC-DS query64 on 1TB Iceberg format table returns no rows  because of this problem. (The correct result has 7094 rows.)
> We use hive.optimize.shared.work=true, hive.optimize.shared.work.extended=true, and hive.optimize.shared.work.dppunion=false to reproduce the bug.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)