You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/02/23 07:39:31 UTC

[GitHub] [hudi] yanenze edited a comment on issue #4855: [SUPPORT] mor table parquet file max file size does not go into effect

yanenze edited a comment on issue #4855:
URL: https://github.com/apache/hudi/issues/4855#issuecomment-1048506073


   > Yes, the file size control is not accurate, how much size does the actual file size exceed your desired threshold ?
   
   hello, i configured the max parquet file size 128M but finally it grow to 8G,
   so i try to find the reason.
   i have create a pull request  in #4879  
   i think the big file generate reason is when bucketAssigner find small file list , is lost the file which is  in pendingCompaction, so the total size only caculate the  (log file size * compressratio (0.35)) 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org