You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@storm.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/05/12 19:36:00 UTC

[jira] [Commented] (STORM-697) Support for Emitting Kafka Message Offset and Partition

    [ https://issues.apache.org/jira/browse/STORM-697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14540305#comment-14540305 ] 

ASF GitHub Bot commented on STORM-697:
--------------------------------------

GitHub user itachi90 opened a pull request:

    https://github.com/apache/storm/pull/549

    STORM-697: allow kafkaSpout to emit message offset and partition

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/Symantec/storm master

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/storm/pull/549.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #549
    
----
commit 23d6b779c21ce89c3440cf86b123c5bf9a9a124c
Author: Yichi Zhang <yi...@symantec.com>
Date:   2015-05-12T17:30:57Z

    STORM-697: allow kafkaSpout to emit message offset and partition for tracking

----


> Support for Emitting Kafka Message Offset and Partition
> -------------------------------------------------------
>
>                 Key: STORM-697
>                 URL: https://issues.apache.org/jira/browse/STORM-697
>             Project: Apache Storm
>          Issue Type: Improvement
>          Components: storm-kafka
>            Reporter: Matthew Tieman
>            Assignee: Matthew Tieman
>   Original Estimate: 4h
>  Remaining Estimate: 4h
>
> It would be nice expose the offset and partition of messages consumed from Kafka to the Scheme generating the Tuples. This is useful for auditing/replaying data from arbitrary points on a Kafka topic. There are a few ways to accomplish this. 
> One is to define a new scheme, say MessageAndMetadataScheme, and add another case in KafkaUtils.generateTuples to determine if the config's schema is an instance of MessageAndMetadataScheme.
> Another is to overload KafkaUtils.generateTuples to accept and instance of MessageAndRealOffset. Next, add a flag to SpoutConfig indicating that the scheme should accept a byte array of MessageAndRealOffset, this flag will be used to determine which generateTuples method to call.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)