You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Qifan Chen (Jira)" <ji...@apache.org> on 2020/11/13 14:44:00 UTC

[jira] [Created] (IMPALA-10325) Parquet scan should use min/max statistics to skip pages based on equi-join predicate

Qifan Chen created IMPALA-10325:
-----------------------------------

             Summary: Parquet scan should use min/max statistics to skip pages based on equi-join predicate
                 Key: IMPALA-10325
                 URL: https://issues.apache.org/jira/browse/IMPALA-10325
             Project: IMPALA
          Issue Type: Bug
            Reporter: Qifan Chen


Parquet stores min/max stats for pages which can be used to skip certain pages if they don't qualify an equi-join predicate. 

The query below ends up scanning all rows for table a, which may not be needed if the min/max of b.ss_addr_sk can be detected and used during the scan of a. 

{code:java}
select a.ss_sold_time_sk from
store_sales a join [SHUFFLE] store_sales b
where a.ss_addr_sk = b.ss_addr_sk and
b.ss_customer_sk < 10
;
{code}




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org