You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Pablo Estrada (JIRA)" <ji...@apache.org> on 2018/08/02 18:32:00 UTC

[jira] [Resolved] (BEAM-3042) Add tracking of bytes read / time spent when reading side inputs

     [ https://issues.apache.org/jira/browse/BEAM-3042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Pablo Estrada resolved BEAM-3042.
---------------------------------
       Resolution: Fixed
    Fix Version/s: 2.6.0

This has been fixed.

> Add tracking of bytes read / time spent when reading side inputs
> ----------------------------------------------------------------
>
>                 Key: BEAM-3042
>                 URL: https://issues.apache.org/jira/browse/BEAM-3042
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core
>            Reporter: Pablo Estrada
>            Assignee: Pablo Estrada
>            Priority: Major
>             Fix For: 2.6.0
>
>          Time Spent: 7h 10m
>  Remaining Estimate: 0h
>
> It is difficult for Dataflow users to understand how modifying a pipeline or data set can affect how much inter-transform IO is used in their job. The intent of this feature request is to help users understand how side inputs behave when they are consumed.
> This will allow users to understand how much time and how much data their pipeline uses to read/write to inter-transform IO. Users will also be able to modify their pipelines and understand how their changes affect these IO metrics.
> For further information, please review the internal Google doc go/insights-transform-io-design-doc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)