You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Bo Cui (Jira)" <ji...@apache.org> on 2022/02/17 10:29:00 UTC

[jira] [Created] (HUDI-3446) support batch Reader in BootstrapOperator#loadRecords

Bo Cui created HUDI-3446:
----------------------------

             Summary: support batch Reader in BootstrapOperator#loadRecords
                 Key: HUDI-3446
                 URL: https://issues.apache.org/jira/browse/HUDI-3446
             Project: Apache Hudi
          Issue Type: Improvement
          Components: flink
            Reporter: Bo Cui


[https://github.com/apache/hudi/blob/433c2573ef5d1f25cba42f169679a4afe5ff2e24/hudi-flink/src/main/java/org/apache/hudi/sink/bootstrap/BootstrapOperator.java#L216]

In our production environment, restarting the flink job requires a lot of memory to load records and send to BucketAssignFunction, but before sending records, all records needs to be cached in the heap of TM.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)