You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Zoltán Borók-Nagy (Jira)" <ji...@apache.org> on 2023/01/10 17:18:00 UTC
[jira] [Assigned] (IMPALA-11826) Avoid calling planFiles() on Iceberg V2 tables when there are no predicates
[ https://issues.apache.org/jira/browse/IMPALA-11826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zoltán Borók-Nagy reassigned IMPALA-11826:
------------------------------------------
Assignee: Zoltán Borók-Nagy
> Avoid calling planFiles() on Iceberg V2 tables when there are no predicates
> ---------------------------------------------------------------------------
>
> Key: IMPALA-11826
> URL: https://issues.apache.org/jira/browse/IMPALA-11826
> Project: IMPALA
> Issue Type: Bug
> Components: Catalog, Frontend
> Reporter: Zoltán Borók-Nagy
> Assignee: Zoltán Borók-Nagy
> Priority: Major
> Labels: impala-iceberg
>
> Similar to IMPALA-11591 but this Jira extends it to V2 tables.
> We can extend IcebergContentFileStore to group data files:
> * data files without corresponding delete files
> * data files with delete files
> Once we have this we don't have to invoke planFiles() on V2 table SCANs without predicates. We can just simply create the usual UNION - ANTI JOIN plan based on IcebergContentFileStore:
> {noformat}
> UNION ALL
> / \
> / \
> SCAN ANTI JOIN
> data files / \
> without / \
> deletes SCAN SCAN
> data files delete files
> with deletes{noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org