You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Zoltán Borók-Nagy (Jira)" <ji...@apache.org> on 2021/11/22 10:51:00 UTC

[jira] [Created] (IMPALA-11034) Resolve schema of old data files in migrated Iceberg tables

Zoltán Borók-Nagy created IMPALA-11034:
------------------------------------------

             Summary: Resolve schema of old data files in migrated Iceberg tables
                 Key: IMPALA-11034
                 URL: https://issues.apache.org/jira/browse/IMPALA-11034
             Project: IMPALA
          Issue Type: Bug
            Reporter: Zoltán Borók-Nagy


When external tables are converted to Iceberg, the data files remain intact.
This means that the old data files don't have field id information which is essential for schema evolution.

However, there is a workaround for this, see: https://github.com/trinodb/trino/issues/9843

Basically we need to translate the current schema to the first schema of the table using the field ids, then we can use name-based on position-based schema resolution in the data files.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org