You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kevin (Jira)" <ji...@apache.org> on 2021/09/09 22:02:00 UTC

[jira] [Commented] (BEAM-1438) The default behavior for the Write transform doesn't work well with the Dataflow streaming runner

    [ https://issues.apache.org/jira/browse/BEAM-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17412862#comment-17412862 ] 

Kevin commented on BEAM-1438:
-----------------------------

Hello, is there any updates on this bug? We have a project which really need to set the shards to 0 on Dataflow Streaming Runner. It will be very appreciated if this issue can be solved. Thanks!

> The default behavior for the Write transform doesn't work well with the Dataflow streaming runner
> -------------------------------------------------------------------------------------------------
>
>                 Key: BEAM-1438
>                 URL: https://issues.apache.org/jira/browse/BEAM-1438
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-dataflow
>            Reporter: Reuven Lax
>            Priority: P3
>             Fix For: 2.5.0
>
>          Time Spent: 4h 20m
>  Remaining Estimate: 0h
>
> If a Write specifies 0 output shards, that implies the runner should pick an appropriate sharding. The default behavior is to write one shard per input bundle. This works well with the Dataflow batch runner, but not with the streaming runner which produces large numbers of small bundles.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)