You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/03/26 00:56:00 UTC

[jira] [Work logged] (BEAM-11779) Dataflow runner v2 tests occasionally fails with misconfigured sharding logic

     [ https://issues.apache.org/jira/browse/BEAM-11779?focusedWorklogId=748109&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-748109 ]

ASF GitHub Bot logged work on BEAM-11779:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 26/Mar/22 00:55
            Start Date: 26/Mar/22 00:55
    Worklog Time Spent: 10m 
      Work Description: tvalentyn commented on a change in pull request #13986:
URL: https://github.com/apache/beam/pull/13986#discussion_r835687617



##########
File path: .test-infra/jenkins/job_LoadTests_ParDo_Python.groovy
##########
@@ -131,7 +131,8 @@ def addStreamingOptions(test) {
     // Use the new Dataflow runner, which offers improved efficiency of Dataflow jobs.
     // See https://cloud.google.com/dataflow/docs/guides/deploying-a-pipeline#dataflow-runner-v2
     // for more details.
-    experiments: 'use_runner_v2',
+    // TODO(BEAM-11779) remove shuffle_mode=appliance with runner v2 once issue is resolved.
+    experiments: 'use_runner_v2, shuffle_mode=appliance',

Review comment:
       this has no effect as it should be a separate experiment. should this change be reverted now in it's entirety? 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 748109)
    Time Spent: 9h  (was: 8h 50m)

> Dataflow runner v2 tests occasionally fails with misconfigured sharding logic
> -----------------------------------------------------------------------------
>
>                 Key: BEAM-11779
>                 URL: https://issues.apache.org/jira/browse/BEAM-11779
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-dataflow
>            Reporter: Yichi Zhang
>            Priority: P3
>          Time Spent: 9h
>  Remaining Estimate: 0h
>
> Dataflow runner v2 test sometimes mysteriously fails data loss error message. This is due to misconfigured sharding logic from backend with combination of shuffle_mode=service and use_runner_v2 logic.
> Example failing pipeline: https://pantheon.corp.google.com/dataflow/jobs/us-central1/2021-02-08_10_20_51-4474446822883759937?project=apache-beam-testing



--
This message was sent by Atlassian Jira
(v8.20.1#820001)