You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 16:22:50 UTC

[GitHub] [beam] damccorm opened a new issue, #20250: Duplicate PubSub subscriptions with python direct runner in Jupyter/Colab environment

damccorm opened a new issue, #20250:
URL: https://github.com/apache/beam/issues/20250

   When running a streaming pipeline on Colab with direct runner, ReadFromPubSub can retain old subscriptions and cause message duplication. For example, manually killing a cell that is running a streaming pubsub pipeline does not delete the pubsub subscription. If the cell is rerun, the ReadFromPubSub component will actually be subscribed twice which results in duplicate messages.
   
   Manually deleting old subscriptions (e.g. via the GCP dashboard) temporarily fixes the problem.
   
   This Colab notebook: [https://gist.github.com/hgarrereyn/64ce87cbbcbe9c34ccdd13eafe49e3fb](https://gist.github.com/hgarrereyn/64ce87cbbcbe9c34ccdd13eafe49e3fb) contains a runnable example of the bug.
   
   Imported from Jira [BEAM-10192](https://issues.apache.org/jira/browse/BEAM-10192). Original Jira may contain additional context.
   Reported by: hgarrereyn.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org