You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Taher Koitawala (Jira)" <ji...@apache.org> on 2019/09/19 12:27:00 UTC

[jira] [Commented] (HUDI-246) Apache Pulsar data source for Hudi

    [ https://issues.apache.org/jira/browse/HUDI-246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16933328#comment-16933328 ] 

Taher Koitawala commented on HUDI-246:
--------------------------------------

Hi [~vinoth] explored a little on this. AFAIK Pulsar only has a SparkStreaming connector. However, there is a custom one written out there **[https://github.com/streamnative/pulsar-spark] which will work as per our needs. I'm not really sure what we should do on this for now.

> Apache Pulsar data source for Hudi
> ----------------------------------
>
>                 Key: HUDI-246
>                 URL: https://issues.apache.org/jira/browse/HUDI-246
>             Project: Apache Hudi (incubating)
>          Issue Type: New Feature
>          Components: deltastreamer
>            Reporter: Taher Koitawala
>            Priority: Major
>
> [Apache Pulsar|https://pulsar.apache.org/en/] is a pub/sub messaging system like Kafka, with a lot of new features like multiple subscription modes, out of the box service discovery etc. The goal here is to add Pulsar as a data source to DeltaStreamer. To get started please follow [Pulsar adaptor for Apache Spark|https://pulsar.apache.org/docs/en/adaptors-spark/]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)