You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Zoltán Borók-Nagy (Jira)" <ji...@apache.org> on 2022/06/07 08:59:00 UTC

[jira] [Created] (IMPALA-11339) Implement LOAD DATA INPATH for Iceberg tables

Zoltán Borók-Nagy created IMPALA-11339:
------------------------------------------

             Summary: Implement LOAD DATA INPATH for Iceberg tables
                 Key: IMPALA-11339
                 URL: https://issues.apache.org/jira/browse/IMPALA-11339
             Project: IMPALA
          Issue Type: Bug
          Components: Frontend
            Reporter: Zoltán Borók-Nagy


Currently Impala doesn't support LOAD DATA statements for Iceberg tables.

Some user workflows still use this statement, so it would be nice to implement it in some way.

A possible solution would be to
 # create a temp table on those sets of files with the right schema
 # run a {{insert into iceberg table select * from tmp table}}
 # drop the tmp table and delete the files in the staging directory

It does some copying, but probably this would be the safest solution.

Users might specify the partition columns in the [PARTITION (partcol1=val1, partcol2=val2 ...)] clause. In this case the data files don't necessarily contain the partition values, i.e. we need to create the tmp table with proper partitioning.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org