You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/10/12 06:51:48 UTC

[GitHub] [iceberg] shay1bz commented on issue #4889: Support ORDERED BY in CTAS statement

shay1bz commented on issue #4889:
URL: https://github.com/apache/iceberg/issues/4889#issuecomment-1275677230

   It seems that Spark ( > 3) does not support creating sorted tables, unless the table is bucketed. Does this mean that the only way to create a sorted Iceberg table (without buckets) is to create it non-sorted, and then apply "WRITE ... ORDERED BY" ?
   
   Currently, we are creating the table with the Java API - not through Spark - with the desired sorting spec, and then writing the DataFrame to the existing, empty table. I'd like to avoid interacting with Iceberg API directly, but the other option (CTAS and then WRITE ORDERED BY) results in 2 Spark jobs. @kbendick I'd really appreciate you opinion on that :D Thanks.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org