You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Aman Sinha (Jira)" <ji...@apache.org> on 2020/04/10 22:43:00 UTC

[jira] [Resolved] (IMPALA-4680) Join disjuncts are not simplified and pushed to the scan nodes when possible

     [ https://issues.apache.org/jira/browse/IMPALA-4680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aman Sinha resolved IMPALA-4680.
--------------------------------
    Resolution: Not A Problem

> Join disjuncts are not simplified and pushed to the scan nodes when possible
> ----------------------------------------------------------------------------
>
>                 Key: IMPALA-4680
>                 URL: https://issues.apache.org/jira/browse/IMPALA-4680
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Frontend
>    Affects Versions: Impala 2.8.0
>            Reporter: Mostafa Mokhtar
>            Assignee: Alexander Behm
>            Priority: Major
>              Labels: planner
>
> While looking at the plan for primitive_conjunct_ordering_1 noticed that the join condition can be simplified and pushed to the scan but it is not. 
> https://github.com/apache/incubator-impala/blob/master/testdata/workloads/targeted-perf/queries/primitive_conjunct_ordering.test#L31
> Query
> {code}
> SELECT sum(l_extendedprice * (1 - l_discount)) AS revenue
> FROM lineitem, part
> WHERE p_partkey = l_partkey
>   AND (p_partkey = 0 OR l_partkey = 0)
> {code}
> {code}
> +-----------------------------------------------------------+
> | Explain String                                            |
> +-----------------------------------------------------------+
> | Estimated Per-Host Requirements: Memory=777.54MB VCores=2 |
> |                                                           |
> | PLAN-ROOT SINK                                            |
> | |                                                         |
> | 06:AGGREGATE [FINALIZE]                                   |
> | |  output: sum:merge(l_extendedprice * (1 - l_discount))  |
> | |                                                         |
> | 05:EXCHANGE [UNPARTITIONED]                               |
> | |                                                         |
> | 03:AGGREGATE                                              |
> | |  output: sum(l_extendedprice * (1 - l_discount))        |
> | |                                                         |
> | 02:HASH JOIN [INNER JOIN, BROADCAST]                      |
> | |  hash predicates: l_partkey = p_partkey                 |
> | |  other predicates: (p_partkey = 0 OR l_partkey = 0)     |
> | |  runtime filters: RF000 <- p_partkey                    |
> | |                                                         |
> | |--04:EXCHANGE [BROADCAST]                                |
> | |  |                                                      |
> | |  01:SCAN HDFS [tpch_300_parquet.part]                   |
> | |     partitions=1/1 files=14 size=1.88GB                 |
> | |                                                         |
> | 00:SCAN HDFS [tpch_300_parquet.lineitem]                  |
> |    partitions=1/1 files=259 size=63.71GB                  |
> |    runtime filters: RF000 -> l_partkey                    |
> +-----------------------------------------------------------+
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org