You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@iceberg.apache.org by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/04/29 01:19:35 UTC

[GitHub] [iceberg] szehon-ho commented on pull request #7430: Allow sparksql to override target split size with session property

szehon-ho commented on PR #7430:
URL: https://github.com/apache/iceberg/pull/7430#issuecomment-1528384337

   I think one use case for different split size, is the more advanced GDPR use cases (row level delete).  
   
   If you have completely optimized the job to do a broadcast join (no shuffle), you can control the number of written files  by split size.  This may be different than doing a regular read where performance is the main concern.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org