You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Zoltán Borók-Nagy (Jira)" <ji...@apache.org> on 2023/01/05 15:04:00 UTC

[jira] [Created] (IMPALA-11826) Avoid calling planFiles() on Iceberg V2 tables when there are no predicates

Zoltán Borók-Nagy created IMPALA-11826:
------------------------------------------

             Summary: Avoid calling planFiles() on Iceberg V2 tables when there are no predicates
                 Key: IMPALA-11826
                 URL: https://issues.apache.org/jira/browse/IMPALA-11826
             Project: IMPALA
          Issue Type: Bug
          Components: Catalog, Frontend
            Reporter: Zoltán Borók-Nagy


Similar to IMPALA-11591 but this Jira extends it to V2 tables.

We can extend IcebergContentFileStore to group data files:
 * data files without corresponding delete files
 * data files with delete files

Once we have this we don't have to invoke planFiles() on V2 table SCANs without predicates. We can just simply create the usual UNION - ANTI JOIN plan based on IcebergContentFileStore:
{noformat}
    UNION ALL
    /        \
   /          \
SCAN           ANTI JOIN
data files       /      \
without         /        \
deletes     SCAN         SCAN
            data files   delete files
            with deletes{noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org