You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Niels Basjes (JIRA)" <ji...@apache.org> on 2015/12/01 10:25:10 UTC

[jira] [Created] (KAFKA-2914) Kafka Connect Source connector for HBase

Niels Basjes created KAFKA-2914:
-----------------------------------

             Summary: Kafka Connect Source connector for HBase 
                 Key: KAFKA-2914
                 URL: https://issues.apache.org/jira/browse/KAFKA-2914
             Project: Kafka
          Issue Type: New Feature
          Components: copycat
            Reporter: Niels Basjes
            Assignee: Ewen Cheslack-Postava


In many cases I see HBase being used to persist data.
I would like to listen to the changes and process them in a streaming system (like Apache Flink).

Feature request: A Kafka Connect "Source" that listens to the changes in a specified HBase table. These changes are then stored in a 'standardized' form in Kafka so that it becomes possible to process the observed changes in near-realtime. I expect this 'standard' to be very HBase specific.

Implementation suggestion: Perhaps listening to the HBase WAL like the "HBase Side Effects Processor" does?
https://github.com/NGDATA/hbase-indexer/tree/master/hbase-sep




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)