You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by "tvalentyn (via GitHub)" <gi...@apache.org> on 2023/09/21 16:33:05 UTC

[GitHub] [beam] tvalentyn commented on a diff in pull request #28564: [Python] Log dependencies at runtime and at submission environment

tvalentyn commented on code in PR #28564:
URL: https://github.com/apache/beam/pull/28564#discussion_r1333326458


##########
sdks/python/apache_beam/runners/portability/stager.py:
##########
@@ -62,6 +62,7 @@
 from urllib.parse import urlparse
 
 from packaging import version
+from pip._internal.operations import freeze

Review Comment:
   let's not use internal APIs. a future upgrade to pip may break this  api call in previously released versions of Beam, this happened before: https://github.com/pypa/pip/issues/5243#issuecomment-381422449 .  Running `pip freeze` command line would be more reliable.
   
   Quick search, but feel free to look more:
   
   https://stackoverflow.com/questions/49923671/are-there-any-function-replacement-for-pip-get-installed-distributions-in-pip
   
   



##########
sdks/python/apache_beam/runners/portability/stager.py:
##########
@@ -84,6 +85,8 @@
 WORKFLOW_TARBALL_FILE = 'workflow.tar.gz'
 REQUIREMENTS_FILE = 'requirements.txt'
 EXTRA_PACKAGES_FILE = 'extra_packages.txt'
+# Filename that stores the submission environment dependencies.
+SUBMISSION_ENV_DEPENDENCIES_FILENAME = 'submission_environment_dependencies.txt'

Review Comment:
   consistency nit: 
   ```suggestion
   SUBMISSION_ENV_DEPENDENCIES_FILE = 'submission_environment_dependencies.txt'
   ```



##########
sdks/python/apache_beam/runners/portability/stager.py:
##########
@@ -365,6 +368,16 @@ def create_job_resources(options,  # type: PipelineOptions
             Stager._create_file_stage_to_artifact(
                 pickled_session_file, names.PICKLED_MAIN_SESSION_FILE))
 
+    # stage the submission environment dependencies
+    local_dependency_file_path = os.path.join(
+        temp_dir, SUBMISSION_ENV_DEPENDENCIES_FILENAME)
+    dependencies = freeze.freeze()

Review Comment:
   Let's make this best effort: if for whatever reason this fails, don't fail the job submission.
   
   You could also consider moving this portion of code into a helper since this method keeps growing (no strong opinion).



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org