You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Csaba Ringhofer (Jira)" <ji...@apache.org> on 2023/05/23 05:33:00 UTC

[jira] [Created] (IMPALA-12157) Use predicates with OR in Parquet filtering

Csaba Ringhofer created IMPALA-12157:
----------------------------------------

             Summary: Use predicates with OR in Parquet filtering
                 Key: IMPALA-12157
                 URL: https://issues.apache.org/jira/browse/IMPALA-12157
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend, Frontend
            Reporter: Csaba Ringhofer


Currently the Parquet scanner doesn't use predicates like "WHERE a=1 OR b=2" for dictionary and stat filtering.
The handling could look like this:
1. evaluate each conjunct at file level (file stats, dict filtering, bloom filters)
   - if all evaluate to false, skip the file
2. check the remaining conjunct's column indexes and create the union of the valid ranges 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org