You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Siyuan Chen (Jira)" <ji...@apache.org> on 2021/04/19 16:41:00 UTC

[jira] [Updated] (BEAM-12040) WriteFiles withRunnerDeterminedSharding for unbounded data doesn't work with session windows

     [ https://issues.apache.org/jira/browse/BEAM-12040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Siyuan Chen updated BEAM-12040:
-------------------------------
    Summary: WriteFiles withRunnerDeterminedSharding for unbounded data doesn't work with session windows  (was: WriteFiles withRunnerDeterminedShardingUnbounded doesn't work with session windows)

> WriteFiles withRunnerDeterminedSharding for unbounded data doesn't work with session windows
> --------------------------------------------------------------------------------------------
>
>                 Key: BEAM-12040
>                 URL: https://issues.apache.org/jira/browse/BEAM-12040
>             Project: Beam
>          Issue Type: Improvement
>          Components: io-java-files
>            Reporter: Siyuan Chen
>            Priority: P2
>          Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> Currently the implementation of `withRunnerDeterminedShardingUnbounded` uses a stateful DoFn to achieve the grouping and batching of the input elements, which doesn't support session windows. One possible way is to add another GBK prior to the stateful DoFn to first get session windows merged and reify the window before invoking the sateful DoFn. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)