You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Stas Levin (JIRA)" <ji...@apache.org> on 2016/08/23 12:58:20 UTC
[jira] [Updated] (BEAM-579) Integrate NamedAggregators into Spark's
sink system
[ https://issues.apache.org/jira/browse/BEAM-579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Stas Levin updated BEAM-579:
----------------------------
Priority: Major (was: Critical)
> Integrate NamedAggregators into Spark's sink system
> ---------------------------------------------------
>
> Key: BEAM-579
> URL: https://issues.apache.org/jira/browse/BEAM-579
> Project: Beam
> Issue Type: Task
> Components: runner-spark
> Reporter: Stas Levin
> Assignee: Amit Sela
>
> At the moment {{NamedAggregators}} is an adapter between Beam's {{Aggregator}} and Spark's {{Accumulator}} and is implemented as a single Spark {{Accumulator}}, holding a map of metrics that can be augmented with new metrics dynamically, after the the pipeline has already started.
> Spark's out-of-the-box metrics mechanism does not support adding metrics to {{Source}} s that have already registered (it pulls their metrics upon registration and never updates them again).
> In light of the above, it would seem that there is a gap to bridge between the dynamic nature of {{NamedAggregators}} and Spark's current metric system so that metrics that are added dynamically are also reported to the defined Spark {{Sink}} s.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)