You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Nahuel Lofeudo (JIRA)" <ji...@apache.org> on 2019/01/26 00:32:00 UTC
[jira] [Created] (BEAM-6511) AbstractGlobalCombineFn hierarchy is
inconsistent
Nahuel Lofeudo created BEAM-6511:
------------------------------------
Summary: AbstractGlobalCombineFn hierarchy is inconsistent
Key: BEAM-6511
URL: https://issues.apache.org/jira/browse/BEAM-6511
Project: Beam
Issue Type: Bug
Components: runner-dataflow
Affects Versions: 2.9.0
Reporter: Nahuel Lofeudo
Assignee: Tyler Akidau
Subclasses of AbstractGlobalCombineFn seem to be arranged in a way that prevents them from being used with the DataflowRunner.
Subclasses of AbstractGlobalCombineFn are under either CombineFn or CombineFnWithContext, which seems to be in itself a CombineFn which has access to PipelineOptions and Side Inputs.
However, the DataflowRunner casts all combiners passed from user code to CombineFn (see [1]) which prevents combiners that extend CombineFnWithContext from being used there.
For example:
public class CustomCombinerFn extends CombineWithContext.CombineFnWithContext<...> \{...}
final PCollectionView<SomeObject<String>> newCollection = oldCollection
.apply("Custom Combiner", Combine.globally(new CustomCombinerFn(filter))
.withSideInputs(filter)
.withoutDefaults()
.asSingletonView());
IMHO either CombineFnWithContext should be a subclass of CombineFn or DataflowRunner should cast the combiner to AbstractGlobalCombineFn.
[1] https://github.com/apache/beam/blob/b83b302ef97767e4ca245ea24e8bd40a6692e72c/runners/google-cloud-dataflow-java/src/main/java/org/apache/beam/runners/dataflow/DataflowRunner.java#L514
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)