You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Daniel Lescohier (JIRA)" <ji...@apache.org> on 2019/04/01 16:59:00 UTC
[jira] [Created] (BEAM-6955) Support Dataflow --sdk_location with
modified version number
Daniel Lescohier created BEAM-6955:
--------------------------------------
Summary: Support Dataflow --sdk_location with modified version number
Key: BEAM-6955
URL: https://issues.apache.org/jira/browse/BEAM-6955
Project: Beam
Issue Type: Bug
Components: runner-dataflow
Affects Versions: 2.11.0
Reporter: Daniel Lescohier
Support Dataflow --sdk_location with modified version number
Determine the version tag to use for the Google Container Registry, for the service image versions to use on the Dataflow worker nodes. Users of Dataflow may be using a locally-modified version of Apache Beam, which they submit to Dataflow with the --sdk_location option. Those users would most likely modify the version number of Apache Beam, so they can distinguish it from the public distribution of Apache Beam. However, the remote nodes in Dataflow still need to bootsrap the worker service with a Docker image that a version tag exists for.
The most appropriate way for system integrators to modify the Apache Beam version number would be to add a Local Version Identifier: https://www.python.org/dev/peps/pep-0440/#local-version-identifiers
If people only use Local Version Identifiers, then we could use the "public" attribute of the pkg_resources version object.
If people instead use a post-release version identifier: https://www.python.org/dev/peps/pep-0440/#post-releases then only the "base_version" attribute would work both of these version number changes.
Since Dataflow documentation does not specify how to modify version numbers, I am choosing to use "base_version" attribute.
Will shortly submit a PR with the change.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)