You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Matthias J. Sax (JIRA)" <ji...@apache.org> on 2016/10/20 18:34:58 UTC
[jira] [Created] (KAFKA-4325) Improve processing of late records
for window operations
Matthias J. Sax created KAFKA-4325:
--------------------------------------
Summary: Improve processing of late records for window operations
Key: KAFKA-4325
URL: https://issues.apache.org/jira/browse/KAFKA-4325
Project: Kafka
Issue Type: Improvement
Components: streams
Reporter: Matthias J. Sax
Assignee: Guozhang Wang
Priority: Minor
Windows are kept until their retention time passed. If a late arriving record is processed that is older than any window kept, a new window is created containing this single late arriving record, the aggregation is computed and the window is immediately discarded afterward (as it is older than retention time).
This behavior might case problems for downstream application as the original window aggregate might we overwritten with the late single-record- aggregate value. Thus, we should rather not process the late arriving record for this case.
However, data loss might not be acceptable for all use cases. In order to enable the use to not lose any data, window operators should allow to register a handler function that is called instead of just dropping the late arriving record.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)