You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Thomas Groh (JIRA)" <ji...@apache.org> on 2017/03/15 00:53:41 UTC
[jira] [Created] (BEAM-1725) SparkRunner should deduplicate when an
UnboundedSource requires Deduping
Thomas Groh created BEAM-1725:
---------------------------------
Summary: SparkRunner should deduplicate when an UnboundedSource requires Deduping
Key: BEAM-1725
URL: https://issues.apache.org/jira/browse/BEAM-1725
Project: Beam
Issue Type: Bug
Components: runner-spark
Reporter: Thomas Groh
The implementation of an Unbounded Read does not inspect the requiresDeduping property of the source, and as such does not appropriately deduplicate sources that require it.
https://github.com/apache/beam/blob/master/runners/spark/src/main/java/org/apache/beam/runners/spark/io/SparkUnboundedSource.java
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)