You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@orc.apache.org by Thomas Abeler <th...@sensenetworks.com> on 2015/08/12 00:38:14 UTC
Reading ORC Files from S3n
Hello,
I am running in some huge performance issues when I want to query my orc
files stored on S3 with Hive 0.12.
My orc files have a stripe size of 16 MB - this might be a problem when it
comes to read performance, but it can't be the only reason. I was running a
simple query on ~600GB ORC Data and it took a 16 node cluster about 2hrs to
perform ~5% of the Job.
Are there any known issues for the hive orcreader?
My create statement looks like this
CREATE external TABLE externalORC (..)
STORED AS ORC
location 's3n://./';
Kind regards,
Thomas