You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@orc.apache.org by Alexandre Gonçalves <aa...@gmail.com> on 2019/07/05 15:14:27 UTC

Java ORC tool, buckets and Amazon Athena

Dear friends,

I am using Java ORC tool (https://orc.apache.org/docs/java-tools.html)
to convert JSON records into ORC files. I am querying the ORC files on
Amazon Athena. It is working like a charm.

My question: can I use the same Java ORC tool to convert JSON records
into a bucketed ORC table?

If not, should I use Apache Hive to perform the conversion from JSON
records into a bucketed ORC table?

I want to see if bucketed ORC files increase the performance of my
queries on Amazon Athena.

I look forward to hearing from you.

All the best,
Alex.