You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/04/25 10:40:12 UTC

[GitHub] [hudi] JerryYue-M commented on pull request #5410: [HUDI-3953]Flink Hudi module should support low-level read and write…

JerryYue-M commented on PR #5410:
URL: https://github.com/apache/hudi/pull/5410#issuecomment-1108399547

   
   
   
   
   > Can we give some explanation about why we need this change ? Looks like it is a code refactor but i see no gains.
   > 
   > One rule is that we should not copy new clazz with similar API with existing public APIs.
   
   Indeed, it had some code refactor. the gain is to use the existing SQL API code to implement a low-level API for users
   eg:
   Write:
    ```
   DataStream<Row> input = dataStreamGen();
           Map<String, String> confMap = new HashMap<>();
           confMap.put("connector" , "hudi");
           confMap.put("table.type", "MERGE_ON_READ");
           confMap.put("path" , "hdfs://127.0.0.1:9000/hudi/hudi_db/mor_test9");
   
           SinkBuilder.builder()
                            .input(input)
                            .options(confMap)
                            .partitions(Arrays.asList("dt", "hr"))
                            .schema(CREATE_TABLE_SCHEMA)
                            .sink();
   ```
   Read
   ```
   Map<String, String> confMap = new HashMap<>();
           confMap.put("connector" , "hudi");
           confMap.put("table.type", "MERGE_ON_READ");
           confMap.put("path" , "hdfs://127.0.0.1:9000/hudi/hudi_db/mor_test9");
           confMap.put("read.streaming.enabled" , "true");
           confMap.put("read.streaming.check-interval" , "4");
   
           DataStream<Row> rowDataStream = SourceBuilder
                   .builder()
                   .env(getEnv())
                   .schema(CREATE_TABLE_SCHEMA)
                   .options(confMap)
                   .partitions(Arrays.asList("dt", "hr"))
                   .source();
           rowDataStream.print();
   ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org