You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@orc.apache.org by Thomas Abeler <th...@sensenetworks.com> on 2015/08/12 00:38:14 UTC

Reading ORC Files from S3n

Hello,

 

I am running in some huge performance issues when I want to query my orc
files stored on S3 with Hive 0.12.

My orc files have a stripe size of 16 MB - this might be a problem when it
comes to read performance, but it can't be the only reason. I was running a
simple query on ~600GB ORC Data and it took a 16 node cluster about 2hrs to
perform ~5% of the Job.

 

Are there any known issues for the hive orcreader?

 

My create statement looks like this

 

CREATE external TABLE externalORC (..)

STORED AS ORC 

location 's3n://./';

 

Kind regards,

Thomas