You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Brian Hulette (Jira)" <ji...@apache.org> on 2022/03/24 16:00:00 UTC

[jira] [Comment Edited] (BEAM-14163) Performance Regressions in streaming python ParDo and GBK Load Tests

    [ https://issues.apache.org/jira/browse/BEAM-14163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17511931#comment-17511931 ] 

Brian Hulette edited comment on BEAM-14163 at 3/24/22, 3:59 PM:
----------------------------------------------------------------

I think https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming is the job that generates the ParDo numbers. It would be nice to identify the specific first run with the regression. https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming/546/ was run on 03/17 at 1PM UTC, the regression is reported at 03/17 at 5PM (no timezone). I'm not sure if there's a timezone that makes that make sense.

If the regression is at 546 the only python relevant changes are https://github.com/apache/beam/commit/02d9657f68fc60bae9704eef0cc98810ea2b143f and https://github.com/apache/beam/commit/9c17960d748b56af4915a3fd1c618b470b0521c3


was (Author: bhulette):
I think https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming is the job that generates the ParDo numbersIt would be nice to identify the specific first run with the regression, https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming/546/ was run on 03/17 at 1PM UTC, the regression is reported at 03/17 at 5PM (no timezone). I'm not sure if there's a timezone that makes that make sense.

If the regression is at 546 the only python relevant changes are https://github.com/apache/beam/commit/02d9657f68fc60bae9704eef0cc98810ea2b143f and https://github.com/apache/beam/commit/9c17960d748b56af4915a3fd1c618b470b0521c3

> Performance Regressions in streaming python ParDo and GBK Load Tests
> --------------------------------------------------------------------
>
>                 Key: BEAM-14163
>                 URL: https://issues.apache.org/jira/browse/BEAM-14163
>             Project: Beam
>          Issue Type: Bug
>          Components: community-metrics, sdk-py-core
>    Affects Versions: 2.38.0
>            Reporter: Daniel Oliveira
>            Priority: P0
>             Fix For: 2.38.0
>
>
> As specified in the [Beam Release Guide|https://beam.apache.org/contribute/release-guide/#4-investigate-performance-regressions], I'm investigating performance regressions. The following load test metrics show a clear and persistant performance regression starting approximately around March 17 and affecting version 2.38.0.
> ParDo Load Tests: http://metrics.beam.apache.org/d/MOi-kf3Zk/pardo-load-tests?orgId=1&var-processingType=streaming&var-sdk=python
> GBK Load Tests: http://metrics.beam.apache.org/d/UYZ-oJ3Zk/gbk-load-tests?orgId=1&var-processingType=streaming&var-sdk=python&from=now-30d&to=now



--
This message was sent by Atlassian Jira
(v8.20.1#820001)