You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by GitBox <gi...@apache.org> on 2020/05/11 02:44:53 UTC

[GitHub] [airflow] mik-laj commented on pull request #8809: [AIRFLOW-6294] Create guide for Dataflow operators

mik-laj commented on pull request #8809:
URL: https://github.com/apache/airflow/pull/8809#issuecomment-626441074


   In my opinion, we need to add some information.
   * new section which will describe the ways to run a pipeline. Currently, the pipeline can be started using a local running executable (DataflowCreateJavaJobOperator, DataflowCreatePythonJobOperator) or Dataflow Template (DataflowTemplatedJobStartOperator) or via KubernetesPodOperator). A description that describes the differences between these methods would be useful.
   * A new section that will describe the differences between asynchronic and blocking execution modes. You should not use blocking pipelines, because it causes a background process to run, which supervises the execution of the job, This is not positive for two reasons - increase consumption of resources and prevents Airflow supervision over the job
   * information that Python operators allow you to specify the Apache Beam version, but if no version is specified, the local version from the environment will be used
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org