You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Rajesh Balamohan (Jira)" <ji...@apache.org> on 2022/04/04 09:24:00 UTC

[jira] [Created] (HIVE-26110) bulk insert into partitioned table creates lots of files in iceberg

Rajesh Balamohan created HIVE-26110:
---------------------------------------

             Summary: bulk insert into partitioned table creates lots of files in iceberg
                 Key: HIVE-26110
                 URL: https://issues.apache.org/jira/browse/HIVE-26110
             Project: Hive
          Issue Type: Bug
            Reporter: Rajesh Balamohan


For e.g, create web_returns table in tpcds in iceberg format and try to copy over data from regular table. More like "insert into web_returns_iceberg as select * from web_returns".

This inserts the data correctly, however there are lot of files present in each partition. IMO, dynamic sort optimisation isn't working fine and this causes records not to be grouped in the final phase.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)