You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2020/07/15 07:07:02 UTC

[GitHub] [hudi] kirkuz commented on issue #1828: [SUPPORT] Cannot force hudi to retain only last commit

kirkuz commented on issue #1828:
URL: https://github.com/apache/hudi/issues/1828#issuecomment-658587504


   Hi @bhasudha, thanks for this information. Now, it's clear how it's working. My use case is as following: I want to have only last change in parquet files, because when I try to read it with AWS Athena it sees doubled record with different _hoodie_commit. In ideal world I want to have one S3 bucket with only last change (to not force users to deduplicate it in AWS Athena query) and the second bucket with all commits to have whole history. 
   
   Can you recommend me sth? 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org