You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Sujay Mansingh (Jira)" <ji...@apache.org> on 2019/09/02 11:48:00 UTC
[jira] [Created] (AIRFLOW-5380) Allow creation of big query dataset
that doesn't fail if dataset already exists
Sujay Mansingh created AIRFLOW-5380:
---------------------------------------
Summary: Allow creation of big query dataset that doesn't fail if dataset already exists
Key: AIRFLOW-5380
URL: https://issues.apache.org/jira/browse/AIRFLOW-5380
Project: Apache Airflow
Issue Type: Improvement
Components: gcp
Affects Versions: 1.10.4
Reporter: Sujay Mansingh
At the moment: BigQueryCreateEmptyDatasetOperator will create a dataset but if a dataset with that id already exists it will fail.
This is not ideal. We have a use case where we need to ensure that the dataset exists. I.e. check and if the dataset doesn't exist then create it, otherwise do nothing.
At the moment we've had to add our own subclass of BigQueryCreateEmptyDatasetOperator that checks for the dataset first, but it'd be really useful if BigQueryCreateEmptyDatasetOperator supported an 'ignore_existing' argument or something similar.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)