You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2020/09/11 07:47:14 UTC

[GitHub] [iceberg] pvary edited a comment on pull request #1407: Hive: HiveIcebergOutputFormat first implementation for handling Hive inserts into unpartitioned Iceberg tables - WIP

pvary edited a comment on pull request #1407:
URL: https://github.com/apache/iceberg/pull/1407#issuecomment-690933303


   The current status of the PR:
   
   **Open questions:**
   - Waiting for #1430 _API: Fix Metrics serialization_ to be approved, so I can get rid of the extra _SerializableMetrics_ class
   - Should every task write a _.committed_ file, or just the task in the last phase of the job - Decision needed what is the main goal, performance or generality. Current solution is more performance friendly
   - Do we need to handle other Iceberg constructs than _newAppend()_ for adding data to the table? If I understand everything correctly this should be enough to handle inserts started from Hive
   
   **Postponed:**
   - Finding a better place for the configuration
   
   **Non-goals:**
   - Updates/Deletes
   - Insert overwrites
   - Partitioned Iceberg table handling
   - Complex structures (Map / List / Struct)


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org