You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2021/08/30 11:22:00 UTC

[jira] [Created] (IMPALA-10898) Runtime IN-list filters for ORC tables

Quanlong Huang created IMPALA-10898:
---------------------------------------

             Summary: Runtime IN-list filters for ORC tables
                 Key: IMPALA-10898
                 URL: https://issues.apache.org/jira/browse/IMPALA-10898
             Project: IMPALA
          Issue Type: Improvement
          Components: Backend
            Reporter: Quanlong Huang


Currently Impala has two kinds of runtime filters: bloom filter and min-max filter. Unfortunately they can't leverage the bloom filters in ORC files. Only EQUALS and IN-list 
predicates can leverage them to skip unrelated ORC RowGroups.

This JIRA aims to add runtime IN-list filters for small build side (e.g. #rows <= 1024) of a hash join.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org