You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/06/10 19:41:34 UTC

[GitHub] [hudi] ganczarek commented on issue #4656: [SUPPORT] Slow file listing after update to Hudi 0.10.0

ganczarek commented on issue #4656:
URL: https://github.com/apache/hudi/issues/4656#issuecomment-1152676222

   @parisni No, I wasn't able to mitigate this. We worked around it by doing the following:
   1) Disabling table metadata for all Hudi tables (`hoodie.metadata.enable=false`) noticeably speeds up read and write times. Also, performance doesn't seem to degrade over time as new partitions are added. 
   2) I redesigned our ETL so that we don't need a Hudi table with 300k partitions. Resulting table is smaller, differently partitioned, has reasonable amount of partitions and uses pure Parquet files (no Hudi)
   
   I keep tracking new releases of Hudi. Eventually, I plan to enable Hoodie metadata back.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org