You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/08/21 06:53:47 UTC

[GitHub] [iceberg] a-agmon commented on issue #5593: Reading records inserted using Athena throws UOE exception when read using Spark (AWS)

a-agmon commented on issue #5593:
URL: https://github.com/apache/iceberg/issues/5593#issuecomment-1221481441

   confirming that the solution proposed by @singhpk234 works.
   creating the table as follows within Spark does not generate the issue and records now can be inserted from Athena and read by Spark. 
   
   ```scala
       val icebergDDL =
         """
           |CREATE TABLE IF NOT EXISTS my_catalog.default.iceberg_demo_1 (
           |    rec_id string,
           |    data string,
           |    category string)
           |USING iceberg
           |PARTITIONED BY (category)
           |TBLPROPERTIES ('read.parquet.vectorization.enabled'='false')
           |
           |""".stripMargin
   
       spark.sql(icebergDDL)
   ```
   Thank you very much! 
   
   Is there a way to create a table with this property enabled using Athena?  (more accurately using a python client - with AWS this is usually done via AWS wrangler that uses Athena Client SDK, which cannot run this property)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org