You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by "tvalentyn (via GitHub)" <gi...@apache.org> on 2023/09/21 16:33:05 UTC
[GitHub] [beam] tvalentyn commented on a diff in pull request #28564: [Python] Log dependencies at runtime and at submission environment
tvalentyn commented on code in PR #28564:
URL: https://github.com/apache/beam/pull/28564#discussion_r1333326458
##########
sdks/python/apache_beam/runners/portability/stager.py:
##########
@@ -62,6 +62,7 @@
from urllib.parse import urlparse
from packaging import version
+from pip._internal.operations import freeze
Review Comment:
let's not use internal APIs. a future upgrade to pip may break this api call in previously released versions of Beam, this happened before: https://github.com/pypa/pip/issues/5243#issuecomment-381422449 . Running `pip freeze` command line would be more reliable.
Quick search, but feel free to look more:
https://stackoverflow.com/questions/49923671/are-there-any-function-replacement-for-pip-get-installed-distributions-in-pip
##########
sdks/python/apache_beam/runners/portability/stager.py:
##########
@@ -84,6 +85,8 @@
WORKFLOW_TARBALL_FILE = 'workflow.tar.gz'
REQUIREMENTS_FILE = 'requirements.txt'
EXTRA_PACKAGES_FILE = 'extra_packages.txt'
+# Filename that stores the submission environment dependencies.
+SUBMISSION_ENV_DEPENDENCIES_FILENAME = 'submission_environment_dependencies.txt'
Review Comment:
consistency nit:
```suggestion
SUBMISSION_ENV_DEPENDENCIES_FILE = 'submission_environment_dependencies.txt'
```
##########
sdks/python/apache_beam/runners/portability/stager.py:
##########
@@ -365,6 +368,16 @@ def create_job_resources(options, # type: PipelineOptions
Stager._create_file_stage_to_artifact(
pickled_session_file, names.PICKLED_MAIN_SESSION_FILE))
+ # stage the submission environment dependencies
+ local_dependency_file_path = os.path.join(
+ temp_dir, SUBMISSION_ENV_DEPENDENCIES_FILENAME)
+ dependencies = freeze.freeze()
Review Comment:
Let's make this best effort: if for whatever reason this fails, don't fail the job submission.
You could also consider moving this portion of code into a helper since this method keeps growing (no strong opinion).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org