You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by "rafalpotempa (via GitHub)" <gi...@apache.org> on 2023/07/14 07:51:18 UTC

[GitHub] [beam] rafalpotempa opened a new issue, #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

rafalpotempa opened a new issue, #27501:
URL: https://github.com/apache/beam/issues/27501

   ### What happened?
   
   Apache Beam: 2.46.0
   `Direct Runner`
   
   We are running some parts of our pipelines using Direct Runners, since we are processing small batches of data there. The rest is using Dataflow runner, which has some auto-recovery. For direct runners it fails the run and triggers alerts for SRE team.
   
   I couldn't reproduce the issue myself, the issue is non-deterministic.
   The part of our company's proprietary system sometimes (not very often ~100 runs) fails with:
   > apache_beam/runners/worker/sdk_worker.py in shutdown_inactive_bundle_processors at line 585
   ```log
   Exception in thread Thread-37:
   Traceback (most recent call last):
     File "/usr/local/lib/python3.9/threading.py", line 980, in _bootstrap_inner
       self.run()
     File "/usr/local/lib/python3.9/site-packages/sentry_sdk/integrations/threading.py", line 72, in run
       reraise(*_capture_exception())
     File "/usr/local/lib/python3.9/site-packages/sentry_sdk/_compat.py", line 60, in reraise
       raise value
     File "/usr/local/lib/python3.9/site-packages/sentry_sdk/integrations/threading.py", line 70, in run
       return old_run_func(self, *a, **kw)
     File "/usr/local/lib/python3.9/site-packages/apache_beam/runners/worker/data_plane.py", line 228, in run
       self._function(*self._args, **self._kwargs)
     File "/usr/local/lib/python3.9/site-packages/apache_beam/runners/worker/sdk_worker.py", line 585, in shutdown_inactive_bundle_processors
       for descriptor_id, last_access_time in self.last_access_times.items():
   RuntimeError: dictionary changed size during iteration
   ```
   
   ### Issue Priority
   
   Priority: 2 (default / most bugs should be filed as P2)
   
   ### Issue Components
   
   - [X] Component: Python SDK
   - [ ] Component: Java SDK
   - [ ] Component: Go SDK
   - [ ] Component: Typescript SDK
   - [ ] Component: IO connector
   - [ ] Component: Beam examples
   - [ ] Component: Beam playground
   - [ ] Component: Beam katas
   - [ ] Component: Website
   - [ ] Component: Spark Runner
   - [ ] Component: Flink Runner
   - [ ] Component: Samza Runner
   - [ ] Component: Twister2 Runner
   - [ ] Component: Hazelcast Jet Runner
   - [ ] Component: Google Cloud Dataflow Runner


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration` [beam]

Posted by "lrakla (via GitHub)" <gi...@apache.org>.
lrakla commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1774487763

   .take-issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] Anneysha7 commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "Anneysha7 (via GitHub)" <gi...@apache.org>.
Anneysha7 commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1646686945

   Hi! It's my first time contributing and I would love to give this a shot!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration` [beam]

Posted by "Mr-Magnificent (via GitHub)" <gi...@apache.org>.
Mr-Magnificent commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1753180204

   .take-issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] tvalentyn commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "tvalentyn (via GitHub)" <gi...@apache.org>.
tvalentyn commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1721533521

   Hi @Anneysha7 i'd like to have this issue fixed in next version of Beam esp since it should be a simple fix.  I suggest  we unassign it for now in case someone else has time to take a look - no worries you can always pick this or any other issue when you have time to work on Beam. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration` [beam]

Posted by "Mr-Magnificent (via GitHub)" <gi...@apache.org>.
Mr-Magnificent commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1753179629

   Hey, would like to give this a go. Might take some time since it would be my first time contributing to apache beam. I'll assign it to myself, feel free to reassign if the criticality requires faster fix @tvalentyn. Thanks


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] tvalentyn commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "tvalentyn (via GitHub)" <gi...@apache.org>.
tvalentyn commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1648743719

   great, thanks a lot, @Anneysha7 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] Anneysha7 commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "Anneysha7 (via GitHub)" <gi...@apache.org>.
Anneysha7 commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1646687050

   .takeissue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] tvalentyn commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "tvalentyn (via GitHub)" <gi...@apache.org>.
tvalentyn commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1668811225

   Hey @Anneysha7 , just wondering if you are still interested or need help. Thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration` [beam]

Posted by "Mr-Magnificent (via GitHub)" <gi...@apache.org>.
Mr-Magnificent commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1763907185

   Realised I am having bandwidth issues, incase issue still open post couple of week, I'd like to give it a try.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] tvalentyn commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "tvalentyn (via GitHub)" <gi...@apache.org>.
tvalentyn commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1636477052

   we should be able to fix this by adding a lock same as https://github.com/apache/beam/blob/41e6628498c6ebdfc2932c61b68bcbde4da1a47e/sdks/python/apache_beam/runners/worker/sdk_worker.py#L470 . 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration` [beam]

Posted by "lrakla (via GitHub)" <gi...@apache.org>.
lrakla commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1774489282

   Hi, I have set up my dev environment and will give it a try. It is my first time contributing too, so I am grateful for any guidance. For now, I'll add a lock in the required section


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration` [beam]

Posted by "tvalentyn (via GitHub)" <gi...@apache.org>.
tvalentyn commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1775535529

   @lrakla Sounds like a plan, thanks for your help. We also have a dev wiki at https://s.apache.org/beam-python-dev-wiki, which you might find helpful.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] Anneysha7 commented on issue #27501: [Bug]: Direct Runner sometimes rises `RuntimeError: dictionary changed size during iteration`

Posted by "Anneysha7 (via GitHub)" <gi...@apache.org>.
Anneysha7 commented on issue #27501:
URL: https://github.com/apache/beam/issues/27501#issuecomment-1672278132

   Hi, sorry for the late reply, still interested! I'll come back to you asap
   if I need help.
   
   On Tue, 8 Aug 2023, 12:20 tvalentyn, ***@***.***> wrote:
   
   > Hey @Anneysha7 <https://github.com/Anneysha7> , just wondering if you are
   > still interested or need help. Thanks!
   >
   > —
   > Reply to this email directly, view it on GitHub
   > <https://github.com/apache/beam/issues/27501#issuecomment-1668811225>, or
   > unsubscribe
   > <https://github.com/notifications/unsubscribe-auth/AEWUSWJCTS3535W7GYL2QPLXUGO73ANCNFSM6AAAAAA2J7D2LU>
   > .
   > You are receiving this because you were mentioned.Message ID:
   > ***@***.***>
   >
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org