You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@apex.apache.org by "Chaitanya (JIRA)" <ji...@apache.org> on 2016/07/18 11:08:20 UTC
[jira] [Created] (APEXMALHAR-2158) Duplication of data emitted when
the Kafka Input Operator(0.8 version) redeploys
Chaitanya created APEXMALHAR-2158:
-------------------------------------
Summary: Duplication of data emitted when the Kafka Input Operator(0.8 version) redeploys
Key: APEXMALHAR-2158
URL: https://issues.apache.org/jira/browse/APEXMALHAR-2158
Project: Apache Apex Malhar
Issue Type: Bug
Reporter: Chaitanya
Assignee: Chaitanya
Kafka Input Operator failure recovery depends on offsetStats map.
"offsetStats" map is updated whenever the message is emitted from operator.
When the operator goes down and comes back, operator sets this map to the consumer and start reading from these offsets. Consumer reads the last emitted message again and operator emits it.
Solution: Increment the offsets before setting to the consumer.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)