You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@storm.apache.org by "Sachin Goyal (JIRA)" <ji...@apache.org> on 2018/12/06 03:56:00 UTC
[jira] [Created] (STORM-3298) Add Cassandra-Spout for bulk and
streaming reads from Cassandra
Sachin Goyal created STORM-3298:
-----------------------------------
Summary: Add Cassandra-Spout for bulk and streaming reads from Cassandra
Key: STORM-3298
URL: https://issues.apache.org/jira/browse/STORM-3298
Project: Apache Storm
Issue Type: New Feature
Reporter: Sachin Goyal
Cassandra is a very important data source and a frequent need is to update a secondary data-sink (like Kafka, Solr, Elastic-Search or an RDBMS) by pulling data from Cassandra.
Both kind of reads in [lambda-architecture|https://en.wikipedia.org/wiki/Lambda_architecture] are required to support Cassandra as a data-source in Storm.
- Batch-processing
- Stream-processing
For comparison with other stream-processing engines:
- Spark hasĀ [spark-cassandra-connector|https://github.com/datastax/spark-cassandra-connector]
- Nifi has [QueryCassandra|https://www.nifi.rocks/apache-nifi-processors/#QueryCassandra]
- Beam has [CassandraIO|https://beam.apache.org/releases/javadoc/2.2.0/org/apache/beam/sdk/io/cassandra/CassandraIO.html]
In the similar spirit, Storm should have a Cassandra Spout too which can do both batch and stream processing
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)