You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@storm.apache.org by "Sachin Goyal (JIRA)" <ji...@apache.org> on 2018/12/06 03:56:00 UTC

[jira] [Created] (STORM-3298) Add Cassandra-Spout for bulk and streaming reads from Cassandra

Sachin Goyal created STORM-3298:
-----------------------------------

             Summary: Add Cassandra-Spout for bulk and streaming reads from Cassandra
                 Key: STORM-3298
                 URL: https://issues.apache.org/jira/browse/STORM-3298
             Project: Apache Storm
          Issue Type: New Feature
            Reporter: Sachin Goyal


Cassandra is a very important data source and a frequent need is to update a secondary data-sink (like Kafka, Solr, Elastic-Search or an RDBMS) by pulling data from Cassandra.

Both kind of reads in [lambda-architecture|https://en.wikipedia.org/wiki/Lambda_architecture] are required to support Cassandra as a data-source in Storm.
 - Batch-processing
 - Stream-processing

For comparison with other stream-processing engines:
- Spark has [spark-cassandra-connector|https://github.com/datastax/spark-cassandra-connector]
- Nifi has [QueryCassandra|https://www.nifi.rocks/apache-nifi-processors/#QueryCassandra]
- Beam has [CassandraIO|https://beam.apache.org/releases/javadoc/2.2.0/org/apache/beam/sdk/io/cassandra/CassandraIO.html]

In the similar spirit, Storm should have a Cassandra Spout too which can do both batch and stream processing



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)