You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Sujay Mansingh (Jira)" <ji...@apache.org> on 2019/09/02 11:48:00 UTC

[jira] [Created] (AIRFLOW-5380) Allow creation of big query dataset that doesn't fail if dataset already exists

Sujay Mansingh created AIRFLOW-5380:
---------------------------------------

             Summary: Allow creation of big query dataset that doesn't fail if dataset already exists
                 Key: AIRFLOW-5380
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-5380
             Project: Apache Airflow
          Issue Type: Improvement
          Components: gcp
    Affects Versions: 1.10.4
            Reporter: Sujay Mansingh


At the moment: BigQueryCreateEmptyDatasetOperator will create a dataset but if a dataset with that id already exists it will fail.

 

This is not ideal. We have a use case where we need to ensure that the dataset exists. I.e. check and if the dataset doesn't exist then create it, otherwise do nothing.

 

At the moment we've had to add our own subclass of BigQueryCreateEmptyDatasetOperator that checks for the dataset first, but it'd be really useful if BigQueryCreateEmptyDatasetOperator supported an 'ignore_existing' argument or something similar.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)