You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/09/28 17:40:25 UTC

[GitHub] [hudi] guanziyue commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

guanziyue commented on PR #6384:
URL: https://github.com/apache/hudi/pull/6384#issuecomment-1261243810

   Not sure if author uses spark. I do understand this save a lot of time on huge table especially in spark streaming mode. In spark, all writing task cannot start until FileSystemView finish loading because Hudi on spark need FileSystemView info to determine small files before generating writing task. 
   In my opinion, memory problem can be solved by other config. For example, using RocksDB Based FileSystemView which is nearly compulsory for large hudi table. But we have few to do for time consuming in this PR.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org