You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by GitBox <gi...@apache.org> on 2023/01/08 19:04:09 UTC

[GitHub] [airflow] VladaZakharova commented on pull request #28564: GCSToBigQueryOperator allows autodetect None and infers schema

VladaZakharova commented on PR #28564:
URL: https://github.com/apache/airflow/pull/28564#issuecomment-1374904744

   Hi Team :)
   As for the changes in [#28284](https://github.com/apache/airflow/pull/28284) -  all the changes for autodetect=True field were also tested for the JSON format files. 
   I was also trying to reproduce the error from @apallerlamudi, but didn't see any errors. The dag i was using:
   ```
   load_json = GCSToBigQueryOperator(
           task_id="gcs_to_bigquery_example_date_json",
           bucket="cloud-samples-data",
           source_objects="bigquery/us-states/us-states.json",
           source_format="NEWLINE_DELIMITED_JSON",
           destination_project_dataset_table="dataset.table",
           write_disposition="WRITE_TRUNCATE",
           allow_quoted_newlines=True,
           encoding='UTF-8',
           autodetect=True,
       )
   ```
   And the result:
   ![image](https://user-images.githubusercontent.com/80038284/211214045-47e016fa-a3c9-47bc-adb4-1338a0b82664.png)
   
   Can you please provide more information for us about the data you are trying to store? May be check the version of Airflow?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org