You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@beam.apache.org by "Stas Levin (JIRA)" <ji...@apache.org> on 2016/08/23 09:08:21 UTC

[jira] [Created] (BEAM-579) Integrate NamedAggregators into Spark's sink system

Stas Levin created BEAM-579:
-------------------------------

             Summary: Integrate NamedAggregators into Spark's sink system
                 Key: BEAM-579
                 URL: https://issues.apache.org/jira/browse/BEAM-579
             Project: Beam
          Issue Type: Task
          Components: runner-spark
            Reporter: Stas Levin
            Assignee: Amit Sela
            Priority: Critical


At the moment {{NamedAggregators}} is an adapter between Beam's {{Aggregator}} and Spark's {{Accumulator}} and is implemented as a single Spark {{Accumulator}}, holding a map of metrics that can be augmented with new metrics dynamically, after the the pipeline has already started. 

Spark's out-of-the-box metrics mechanism does not support adding metrics to {{Source}} s that have already registered (it pulls their metrics upon registration and never updates them again).

In light of the above, it would seem that there is a gap to bridge between the dynamic nature of {{NamedAggregators}} and Spark's current metric system so that metrics that are added dynamically are also reported to the defined Spark {{Sink}} s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)