You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "George Song (Jira)" <ji...@apache.org> on 2021/05/18 05:27:00 UTC
[jira] [Updated] (HIVE-25120) VectorizedParquetRecordReader can't
handle encrypted parquet files
[ https://issues.apache.org/jira/browse/HIVE-25120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
George Song updated HIVE-25120:
-------------------------------
Summary: VectorizedParquetRecordReader can't handle encrypted parquet files (was: VectorizedParquetRecordReader can't to read parquet file with encrypted footer)
> VectorizedParquetRecordReader can't handle encrypted parquet files
> ------------------------------------------------------------------
>
> Key: HIVE-25120
> URL: https://issues.apache.org/jira/browse/HIVE-25120
> Project: Hive
> Issue Type: Bug
> Components: Parquet
> Affects Versions: 3.1.2
> Reporter: George Song
> Priority: Major
>
> In parquet 1.12.0 the modular encryption feature is introduced. https://issues.apache.org/jira/browse/PARQUET-1178
> VectorizedParquetRecordReader can't read parquet files with encrypted footer. It throws the following exceptions.
> {code:java}
> Error: java.io.IOException: java.lang.reflect.InvocationTargetException
> at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderCreationException(HiveIOExceptionHandlerChain.java:97)
> at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderCreationException(HiveIOExceptionHandlerUtil.java:57)
> at org.apache.hadoop.hive.shims.HadoopShimsSecure$CombineFileRecordReader.initNextRecordReader(HadoopShimsSecure.java:271)
> at org.apache.hadoop.hive.shims.HadoopShimsSecure$CombineFileRecordReader.<init>(HadoopShimsSecure.java:217)
> at org.apache.hadoop.hive.shims.HadoopShimsSecure$CombineFileInputFormatShim.getRecordReader(HadoopShimsSecure.java:345)
> at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getRecordReader(CombineHiveInputFormat.java:719)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.<init>(MapTask.java:175)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:444)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:349)
> at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:178)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:422)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1845)
> at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:172)
> Caused by: java.lang.reflect.InvocationTargetException
> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
> at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
> at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
> at org.apache.hadoop.hive.shims.HadoopShimsSecure$CombineFileRecordReader.initNextRecordReader(HadoopShimsSecure.java:257)
> ... 11 more
> Caused by: java.lang.RuntimeException: org.apache.parquet.crypto.ParquetCryptoRuntimeException: Trying to read file with encrypted footer. No keys available
> at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.<init>(VectorizedParquetRecordReader.java:156)
> at org.apache.hadoop.hive.ql.io.parquet.VectorizedParquetInputFormat.getRecordReader(VectorizedParquetInputFormat.java:50)
> at org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat.getRecordReader(MapredParquetInputFormat.java:87)
> at org.apache.hadoop.hive.ql.io.CombineHiveRecordReader.<init>(CombineHiveRecordReader.java:99)
> ... 16 more
> Caused by: org.apache.parquet.crypto.ParquetCryptoRuntimeException: Trying to read file with encrypted footer. No keys available
> at org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:588)
> at org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:527)
> at org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:521)
> at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.readFooterFromFile(VectorizedParquetRecordReader.java:345)
> at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.readSplitFooter(VectorizedParquetRecordReader.java:310)
> at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.initialize(VectorizedParquetRecordReader.java:222)
> at org.apache.hadoop.hive.ql.io.parquet.vector.VectorizedParquetRecordReader.<init>(VectorizedParquetRecordReader.java:151)
> ... 19 more
> {code}
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)