You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pulsar.apache.org by Maneesh Bhunwal <ma...@gmail.com> on 2022/11/11 11:11:43 UTC

Direct Approach for pulsar integration with spark

Hi Team,

There are 2 approaches to integrate apache kafka with apache spark as
mentioned here
<https://spark.apache.org/docs/2.4.5/streaming-kafka-0-8-integration.html>
with pros and cons.

But the current pulsar spark adaptor
<https://github.com/apache/pulsar-adapters/blob/master/pulsar-spark/src/main/java/org/apache/pulsar/spark/SparkStreamingPulsarReceiver.java#L75>
only gives the option to integrate with a receiver based approach. This
makes migration from kafka to pulsar difficult as we are having to maintain
another intermediate store to make sure we don't lose any updates.


Are there any plans to work on the direct approach? Was it missed
intentionally? IF we have plans, I would like to contribute to it.


Regards
Maneesh Bhunwal