You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Omid Vahdaty (JIRA)" <ji...@apache.org> on 2019/08/06 07:22:00 UTC

[jira] [Created] (AIRFLOW-5118) Airflow DataprocClusterCreateOperator does not currently support setting optional components

Omid Vahdaty created AIRFLOW-5118:
-------------------------------------

             Summary: Airflow DataprocClusterCreateOperator does not currently support setting optional components
                 Key: AIRFLOW-5118
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-5118
             Project: Apache Airflow
          Issue Type: New Feature
          Components: operators
    Affects Versions: 1.10.3
            Reporter: Omid Vahdaty


From the source code of the DataprocClusterCreateOperator[1], the only software configs that can be set are the imageVersion and the properties. As the Zeppelin component needs to be set through softwareConfig optionalComponents[2], the DataprocClusterCreateOperator does not currently support setting optional components. 

As a workaround for the time being, you could create your clusters by directly using the gcloud command rather than the DataprocClusterCreateOperator . Using the Airflow BashOperator[4], you can execute gcloud commands that create your Dataproc cluster with the required optional components. 

[1] [https://github.com/apache/airflow/blob/master/airflow/contrib/operators/dataproc_operator.py] 
[2] [https://cloud.google.com/dataproc/docs/reference/rest/v1/ClusterConfig#softwareconfig] 
 
[3] [https://airflow.apache.org/howto/operator/bash.html] 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)