You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Pablo Estrada (JIRA)" <ji...@apache.org> on 2018/08/02 18:32:00 UTC
[jira] [Resolved] (BEAM-3042) Add tracking of bytes read / time
spent when reading side inputs
[ https://issues.apache.org/jira/browse/BEAM-3042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Pablo Estrada resolved BEAM-3042.
---------------------------------
Resolution: Fixed
Fix Version/s: 2.6.0
This has been fixed.
> Add tracking of bytes read / time spent when reading side inputs
> ----------------------------------------------------------------
>
> Key: BEAM-3042
> URL: https://issues.apache.org/jira/browse/BEAM-3042
> Project: Beam
> Issue Type: Bug
> Components: sdk-py-core
> Reporter: Pablo Estrada
> Assignee: Pablo Estrada
> Priority: Major
> Fix For: 2.6.0
>
> Time Spent: 7h 10m
> Remaining Estimate: 0h
>
> It is difficult for Dataflow users to understand how modifying a pipeline or data set can affect how much inter-transform IO is used in their job. The intent of this feature request is to help users understand how side inputs behave when they are consumed.
> This will allow users to understand how much time and how much data their pipeline uses to read/write to inter-transform IO. Users will also be able to modify their pipelines and understand how their changes affect these IO metrics.
> For further information, please review the internal Google doc go/insights-transform-io-design-doc.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)