You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/06/01 20:26:25 UTC

[GitHub] [hudi] prashantwason commented on issue #2995: [SUPPORT] Upserts creating duplicates after enabling metadata table in Hudi 0.7 indexing pipeline

prashantwason commented on issue #2995:
URL: https://github.com/apache/hudi/issues/2995#issuecomment-852423034


   @jtmzheng if you have logs from the spark applications, please grep to check if any HoodieMetadataException was ever raised. There may have been an exception which got swallowed so the pipeline did not fail but there was an error anyways.
   
   Also, can you check if you have any non-completed instants on your hoodie timeline (i.e. instants which have .requested and .inflight but not the third file). Alternatively, you can provide the listing of the following two dirs and I can check:
    s3://plaid-test-emr/hudi/production/transactions/.hoodie
    s3://plaid-test-emr/hudi/production/transactions/.hoodie/metadata/.hoodie
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org