You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "liyuanzhao435 (Jira)" <ji...@apache.org> on 2021/10/19 05:34:00 UTC

[jira] [Comment Edited] (HUDI-2576) flink do checkpoint error because parquet file is missing

    [ https://issues.apache.org/jira/browse/HUDI-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17430316#comment-17430316 ] 

liyuanzhao435 edited comment on HUDI-2576 at 10/19/21, 5:33 AM:
----------------------------------------------------------------

flink jobmanager deleted the file :

 

*2021-10-19 12:47:34,606 INFO org.apache.hudi.common.util.CommitUtils [] - Creating metadata for null numWriteStats:1numReplaceFileIds:0*
 *2021-10-19 12:47:34,607 INFO org.apache.hudi.client.AbstractHoodieWriteClient [] - Committing 20211019124727 action deltacommit*
 *2021-10-19 12:47:34,615 INFO org.apache.hudi.table.HoodieTable [] - Removing duplicate data files created due to spark retries before committing. Paths=[aa/c6eff439-d4e0-4deb-af43-f6906ab71d2b-0_0-1-0_20211019124727.parquet]*
 *2021-10-19 12:47:34,617 INFO org.apache.hudi.table.HoodieTable [] - {color:#de350b}Deleting invalid data files{color}=[(hdfs://XXXX:9999/tmp/test_liyz2/aa,hdfs://XXXX:9999/tmp/test_liyz2/aa/c6eff439-d4e0-4deb-af43-f6906ab71d2b-0_0-1-0_20211019124727.parquet)]*
 *2021-10-19 12:47:34,676 INFO org.apache.hudi.common.table.timeline.HoodieActiveTimeline [] - Marking instant complete [==>20211019124727__deltacommit__INFLIGHT]*


was (Author: liyuanzhao435):
flink jobmanager deleted the file :

 

*2021-10-19 12:47:34,606 INFO org.apache.hudi.common.util.CommitUtils [] - Creating metadata for null numWriteStats:1numReplaceFileIds:0*
 *2021-10-19 12:47:34,607 INFO org.apache.hudi.client.AbstractHoodieWriteClient [] - Committing 20211019124727 action deltacommit*
 *2021-10-19 12:47:34,615 INFO org.apache.hudi.table.HoodieTable [] - Removing duplicate data files created due to spark retries before committing. Paths=[aa/c6eff439-d4e0-4deb-af43-f6906ab71d2b-0_0-1-0_20211019124727.parquet]*
 *2021-10-19 12:47:34,617 INFO org.apache.hudi.table.HoodieTable [] - {color:#de350b}Deleting invalid data files{color}=[(hdfs://XXXX:9999/tmp/test_liyz2/aa,hdfs://XXXX:9999/tmp/test_liyz2/aa/c6eff439-d4e0-4deb-af43-f6906ab71d2b-0_0-1-0_20211019124727.parquet)]*
 *2021-10-19 12:47:34,676 INFO org.apache.hudi.common.table.timeline.HoodieActiveTimeline [] - Marking instant complete [==>20211019124727__deltacommit__INFLIGHT]*
 *2021-10-19 12:47:34,677 INFO org.apache.hudi.common.table.timeline.HoodieActiveTimeline [] - Checking for file exists ?hdfs://26.6.4.165:8020/tmp/test_liyz2/.hoodie/20211019124727.deltacommit.inflight*
 *2021-10-19 12:47:34,691 INFO org.apache.hudi.common.table.timeline.HoodieActiveTimeline [] - Create new file for toInstant ?hdfs://26.6.4.165:8020/tmp/test_liyz2/.hoodie/20211019124727.deltacommit*
 *2021-10-19 12:47:34,691 INFO org.apache.hudi.common.table.timeline.HoodieActiveTimeline [] - Completed [==>20211019124727__deltacommit__INFLIGHT]*
 *20*

> flink do  checkpoint error because parquet file is missing
> ----------------------------------------------------------
>
>                 Key: HUDI-2576
>                 URL: https://issues.apache.org/jira/browse/HUDI-2576
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: Flink Integration
>    Affects Versions: 0.10.0
>            Reporter: liyuanzhao435
>            Priority: Major
>              Labels: flink, hudi
>             Fix For: 0.10.0
>
>         Attachments: error.txt
>
>   Original Estimate: 96h
>  Remaining Estimate: 96h
>
> hudi:0.10.0, flink 1.13.1
> some times when flink do checkpoint , error occurs,  the error shows a hudi parquet file is missing (says file not exists) : 
> *2021-10-19 09:20:03,796 INFO org.apache.hudi.io.storage.row.HoodieRowDataCreateHandle [] - start close hoodie row data*
> *2021-10-19 09:20:03,800 WARN org.apache.hadoop.hdfs.DataStreamer [] - DataStreamer Exception*
> *java.io.FileNotFoundException: File does not exist: /tmp/test_liyz2/aa/2ff301cc-8db2-478e-b707-e8f2327ba38f-0_0-1-4_20211019091917.parquet (inode 32234795) Holder DFSClient_NONMAPREDUCE_633610786_99 does not have any open files.*
>  *at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(FSNamesystem.java:2815)*
>  
> detail see  appendix



--
This message was sent by Atlassian Jira
(v8.3.4#803005)