You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "JinxinTang (Jira)" <ji...@apache.org> on 2022/07/19 09:34:00 UTC
[jira] [Commented] (HUDI-4422) read parquet failed due to length is 0 or corrupt parquet file
[ https://issues.apache.org/jira/browse/HUDI-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568442#comment-17568442 ]
JinxinTang commented on HUDI-4422:
----------------------------------
Please assign to me, I can fix it.
> read parquet failed due to length is 0 or corrupt parquet file
> --------------------------------------------------------------
>
> Key: HUDI-4422
> URL: https://issues.apache.org/jira/browse/HUDI-4422
> Project: Apache Hudi
> Issue Type: Bug
> Reporter: JinxinTang
> Priority: Major
>
> Caused by: java.lang.RuntimeException: [hdfs://xxx/user/xxx/dvc_dw/xxx/]2022-07-02/19/18406cad-4f0d-45a7-b0ae-e9e97cda9315_21-24-22_20220704190443531.parquet is not a Parquet file. Expected magic number at tail, but found [-118, -54, 104, 4]
> at [org.apache.hudi.org|http://org.apache.hudi.org/].apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:556)
> at [org.apache.hudi.org|http://org.apache.hudi.org/].apache.parquet.hadoop.ParquetFileReader.<init>(ParquetFileReader.java:776)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)