You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Tamas Mate (Jira)" <ji...@apache.org> on 2022/08/11 11:41:00 UTC

[jira] [Resolved] (IMPALA-10453) Support file/partition pruning via runtime filters on Iceberg

     [ https://issues.apache.org/jira/browse/IMPALA-10453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tamas Mate resolved IMPALA-10453.
---------------------------------
    Resolution: Resolved

> Support file/partition pruning via runtime filters on Iceberg
> -------------------------------------------------------------
>
>                 Key: IMPALA-10453
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10453
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>            Reporter: Tim Armstrong
>            Assignee: Tamas Mate
>            Priority: Major
>              Labels: iceberg, impala-iceberg, performance
>
> This is a placeholder to figure out what we'd need to do to support dynamic file-level pruning in Iceberg using runtime filters, i.e. have parity for partition pruning.
> * If there is a single partition value per file, then applying bloom filters to the row group stats would be effective at pruning files.
> * If there are partition transforms, e.g. hash-based, then I think we probably need to track the partition that the file is associated with and then have some custom logic in the parquet scanner to do partition pruning.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)