You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "liujinhui (Jira)" <ji...@apache.org> on 2020/01/08 12:53:00 UTC

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

    [ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17010658#comment-17010658 ] 

liujinhui commented on HUDI-76:
-------------------------------

[~guoyihua]  hello, CSV to ROW I see your implementation, I think the key point is to format the data, the current implementation data is not a good format, I think it can be parsed according to the Hudi-schema? Are you dealing with it? I also encountered such a scenario recently, and I am ready to deal with this problem

> CSV Source support for Hudi Delta Streamer
> ------------------------------------------
>
>                 Key: HUDI-76
>                 URL: https://issues.apache.org/jira/browse/HUDI-76
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>          Components: DeltaStreamer, Incremental Pull
>            Reporter: Balaji Varadarajan
>            Assignee: Ethan Guo
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 0.5.1
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> DeltaStreamer does not have support to pull CSV data from sources (hdfs log files/kafka). THis ticket is to provide support for csv sources.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)