You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2020/06/19 15:42:27 UTC

[GitHub] [hudi] vinothchandar commented on issue #1737: [SUPPORT]spark streaming create small parquet files

vinothchandar commented on issue #1737:
URL: https://github.com/apache/hudi/issues/1737#issuecomment-646704256


   @cocopc MOR writes out parquet files for inserts only.. and we do have small file handling for those parquet files as well.. 
   https://github.com/apache/hudi/blob/release-0.5.2/hudi-client/src/main/java/org/apache/hudi/table/HoodieMergeOnReadTable.java#L349
   
   its controlled by the same `hoodie.parquet.max.file.size=120M` config you have.. can you provide us with hoodie cli output or a listing of the file system for us to look at?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org