You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Alexey Romanenko (Jira)" <ji...@apache.org> on 2021/03/16 18:44:00 UTC

[jira] [Updated] (BEAM-8934) Store&Read offset with KafkaIO

     [ https://issues.apache.org/jira/browse/BEAM-8934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexey Romanenko updated BEAM-8934:
-----------------------------------
    Status: Open  (was: Triage Needed)

> Store&Read offset with KafkaIO
> ------------------------------
>
>                 Key: BEAM-8934
>                 URL: https://issues.apache.org/jira/browse/BEAM-8934
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-java-kafka
>            Reporter: jiefeng zheng
>            Priority: P3
>              Labels: KafkaIO, apache, beam, offset
>
> When creating a Pipeline through a KafkaIO object, I want to be able to specify the starting offset of consumption, and when traversing the message later, I can get the offset of the current message for storage in a relational database / NoSQL.
>  
> This feature is used to implement the exactly-once semantics of spark streaming consumption.
>  
> In the "Your own data store" section of the following url content, you can find how to achieve exactly-once semantics with spark streaming:
> [http://spark.apache.org/docs/latest/streaming-kafka-0-10-integration.html|http://spark.apache.org/docs/latest/streaming-kafka-0-10-integration.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)