You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Thomas Groh (JIRA)" <ji...@apache.org> on 2017/04/19 17:54:41 UTC

[jira] [Assigned] (BEAM-2007) DataflowRunner drops Reads with no consumers

     [ https://issues.apache.org/jira/browse/BEAM-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thomas Groh reassigned BEAM-2007:
---------------------------------

    Assignee:     (was: Thomas Groh)

This is fixed in the Java Dataflow runner. The Python Dataflow runner will have to perform a similar change.

> DataflowRunner drops Reads with no consumers
> --------------------------------------------
>
>                 Key: BEAM-2007
>                 URL: https://issues.apache.org/jira/browse/BEAM-2007
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-dataflow
>            Reporter: Daniel Halperin
>             Fix For: First stable release
>
>
> Basically, if a pipeline has "just" a Read with no consumers, the optimizer in Dataflow will drop it. To preserve Beam semantics, we do want to run the Read and drop its output, e.g., because the Read may have side effects that we're testing for.
> Is it possible with pipeline surgery to find such Reads and add an Identity ParDo to them?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)