You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/03 16:20:43 UTC

[GitHub] [beam] kennknowles opened a new issue, #18102: When and how can merging windows "shrink" or "grow"?

kennknowles opened a new issue, #18102:
URL: https://github.com/apache/beam/issues/18102

   The primary example of a merging window today is `Sessions` by gap duration, in which the merged window is the interval enclosure / span of the windows being merged.
   
   However, another reasonable abstract use case is a session identified by id with an explicit end event. We might consider that the session ends with no gap duration after the end event. In this case, the merged window may be smaller than the enclosure of the sub-windows. Sometimes this has been referred to as "merging shrinks the window".
   
   Perhaps the only requirement is that the merged window contains the timestamps of the data therein, but we should document this clearly. The current spec is ["Does whatever merging is necessary"](https://github.com/apache/incubator-beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/windowing/WindowFn.java#L106)
   
   There are repercussions for triggers, some documented in the [trigger design doc](https://s.apache.org/beam-triggers): With nonzero allowed lateness, `Sessions` by gap duration can switch a trigger from ON_TIME or LATE behavior back to speculative behavior, or cause another ON_TIME firing. Conversely, sessions with abrupt termination/shrinking may have that behavior _as well as_ an ON_TIME and subsequent LATE firings due only to the merging (this already works properly).
   
   Imported from Jira [BEAM-654](https://issues.apache.org/jira/browse/BEAM-654). Original Jira may contain additional context.
   Reported by: kenn.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org