You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@storm.apache.org by "Sachin Goyal (JIRA)" <ji...@apache.org> on 2018/12/06 03:58:00 UTC

[jira] [Commented] (STORM-3298) Add Cassandra-Spout for bulk and streaming reads from Cassandra

    [ https://issues.apache.org/jira/browse/STORM-3298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16710964#comment-16710964 ] 

Sachin Goyal commented on STORM-3298:
-------------------------------------

We have developed a Cassandra-Spout for batch and stream processing that we will be contributing shortly.

This issue is meant to help with that contribution

> Add Cassandra-Spout for bulk and streaming reads from Cassandra
> ---------------------------------------------------------------
>
>                 Key: STORM-3298
>                 URL: https://issues.apache.org/jira/browse/STORM-3298
>             Project: Apache Storm
>          Issue Type: New Feature
>            Reporter: Sachin Goyal
>            Priority: Major
>
> Cassandra is a very important data source and a frequent need is to update a secondary data-sink (like Kafka, Solr, Elastic-Search or an RDBMS) by pulling data from Cassandra.
> Both kind of reads in [lambda-architecture|https://en.wikipedia.org/wiki/Lambda_architecture] are required to support Cassandra as a data-source in Storm.
>  - Batch-processing
>  - Stream-processing
> For comparison with other stream-processing engines:
> - Spark has [spark-cassandra-connector|https://github.com/datastax/spark-cassandra-connector]
> - Nifi has [QueryCassandra|https://www.nifi.rocks/apache-nifi-processors/#QueryCassandra]
> - Beam has [CassandraIO|https://beam.apache.org/releases/javadoc/2.2.0/org/apache/beam/sdk/io/cassandra/CassandraIO.html]
> In the similar spirit, Storm should have a Cassandra Spout too which can do both batch and stream processing



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)