You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Tim Armstrong (Jira)" <ji...@apache.org> on 2020/06/13 00:12:00 UTC

[jira] [Assigned] (IMPALA-6006) Incorrect cardinality estimation when dimension table has inequality predicate

     [ https://issues.apache.org/jira/browse/IMPALA-6006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Armstrong reassigned IMPALA-6006:
-------------------------------------

    Assignee:     (was: Philip Martin)

> Incorrect cardinality estimation when dimension table has inequality predicate
> ------------------------------------------------------------------------------
>
>                 Key: IMPALA-6006
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6006
>             Project: IMPALA
>          Issue Type: Bug
>    Affects Versions: Impala 2.11.0
>            Reporter: Mostafa Mokhtar
>            Priority: Major
>
> Query 
> {code}
> select count(*)
>        from catalog_sales
>         JOIN date_dim ON catalog_sales.cs_sold_date_sk = date_dim.d_date_sk
>        where
>          d_month_seq between 1193 and 1193+11;
> {code}
> Plan
> {code}
> +-------------------------------------------------------------------------------+
> | Explain String                                                                |
> +-------------------------------------------------------------------------------+
> | Max Per-Host Resource Reservation: Memory=1.94MB                              |
> | Per-Host Resource Estimates: Memory=54.94MB                                   |
> |                                                                               |
> | F02:PLAN FRAGMENT [UNPARTITIONED] hosts=1 instances=1                         |
> | |  Per-Host Resources: mem-estimate=10.00MB mem-reservation=0B                |
> | PLAN-ROOT SINK                                                                |
> | |  mem-estimate=0B mem-reservation=0B                                         |
> | |                                                                             |
> | 06:AGGREGATE [FINALIZE]                                                       |
> | |  output: count:merge(*)                                                     |
> | |  mem-estimate=10.00MB mem-reservation=0B spill-buffer=2.00MB                |
> | |  tuple-ids=2 row-size=8B cardinality=1                                      |
> | |                                                                             |
> | 05:EXCHANGE [UNPARTITIONED]                                                   |
> | |  mem-estimate=0B mem-reservation=0B                                         |
> | |  tuple-ids=2 row-size=8B cardinality=1                                      |
> | |                                                                             |
> | F00:PLAN FRAGMENT [RANDOM] hosts=7 instances=7                                |
> | Per-Host Resources: mem-estimate=12.94MB mem-reservation=1.94MB               |
> | 03:AGGREGATE                                                                  |
> | |  output: count(*)                                                           |
> | |  mem-estimate=10.00MB mem-reservation=0B spill-buffer=2.00MB                |
> | |  tuple-ids=2 row-size=8B cardinality=1                                      |
> | |                                                                             |
> | 02:HASH JOIN [INNER JOIN, BROADCAST]                                          |
> | |  hash predicates: catalog_sales.cs_sold_date_sk = date_dim.d_date_sk        |
> | |  fk/pk conjuncts: catalog_sales.cs_sold_date_sk = date_dim.d_date_sk        |
> | |  runtime filters: RF000 <- date_dim.d_date_sk                               |
> | |  mem-estimate=1.94MB mem-reservation=1.94MB spill-buffer=64.00KB            |
> | |  tuple-ids=0,1 row-size=16B cardinality=14399964710                         |
> | |                                                                             |
> | |--04:EXCHANGE [BROADCAST]                                                    |
> | |  |  mem-estimate=0B mem-reservation=0B                                      |
> | |  |  tuple-ids=1 row-size=8B cardinality=7305                                |
> | |  |                                                                          |
> | |  F01:PLAN FRAGMENT [RANDOM] hosts=1 instances=1                             |
> | |  Per-Host Resources: mem-estimate=32.00MB mem-reservation=0B                |
> | |  01:SCAN HDFS [tpcds_10000_parquet.date_dim, RANDOM]                        |
> | |     partitions=1/1 files=1 size=2.15MB                                      |
> | |     predicates: d_month_seq <= 1204, d_month_seq >= 1193                    |
> | |     stats-rows=73049 extrapolated-rows=disabled                             |
> | |     table stats: rows=73049 size=unavailable                                |
> | |     column stats: all                                                       |
> | |     parquet statistics predicates: d_month_seq <= 1204, d_month_seq >= 1193 |
> | |     parquet dictionary predicates: d_month_seq <= 1204, d_month_seq >= 1193 |
> | |     mem-estimate=32.00MB mem-reservation=0B                                 |
> | |     tuple-ids=1 row-size=8B cardinality=7305                                |
> | |                                                                             |
> | 00:SCAN HDFS [tpcds_10000_parquet.catalog_sales, RANDOM]                      |
> |    partitions=1837/1837 files=5055 size=971.94GB                              |
> |    runtime filters: RF000 -> catalog_sales.cs_sold_date_sk                    |
> |    stats-rows=14399964710 extrapolated-rows=disabled                          |
> |    table stats: rows=14399964710 size=unavailable                             |
> |    column stats: all                                                          |
> |    mem-estimate=1.00MB mem-reservation=0B                                     |
> |    tuple-ids=0 row-size=8B cardinality=14399964710                            |
> +-------------------------------------------------------------------------------+
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org