You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Rajesh Balamohan (Jira)" <ji...@apache.org> on 2022/04/04 09:24:00 UTC
[jira] [Created] (HIVE-26110) bulk insert into partitioned table creates lots of files in iceberg
Rajesh Balamohan created HIVE-26110:
---------------------------------------
Summary: bulk insert into partitioned table creates lots of files in iceberg
Key: HIVE-26110
URL: https://issues.apache.org/jira/browse/HIVE-26110
Project: Hive
Issue Type: Bug
Reporter: Rajesh Balamohan
For e.g, create web_returns table in tpcds in iceberg format and try to copy over data from regular table. More like "insert into web_returns_iceberg as select * from web_returns".
This inserts the data correctly, however there are lot of files present in each partition. IMO, dynamic sort optimisation isn't working fine and this causes records not to be grouped in the final phase.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)