You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@apex.apache.org by "Chaitanya (JIRA)" <ji...@apache.org> on 2016/07/18 11:08:20 UTC

[jira] [Created] (APEXMALHAR-2158) Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys

Chaitanya created APEXMALHAR-2158:
-------------------------------------

             Summary: Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys
                 Key: APEXMALHAR-2158
                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2158
             Project: Apache Apex Malhar
          Issue Type: Bug
            Reporter: Chaitanya
            Assignee: Chaitanya


Kafka Input Operator failure recovery depends on offsetStats map.
"offsetStats" map is updated whenever the message is emitted from operator.

When the operator goes down and comes back, operator sets this map to the consumer and start reading from these offsets. Consumer reads the last emitted message again and operator emits it.

Solution: Increment the offsets before setting to the consumer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)