You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "John Bodley (JIRA)" <ji...@apache.org> on 2016/06/20 16:07:05 UTC
[jira] [Created] (AIRFLOW-259) MySqlToHiveOperator incorrectly
requotes strings due to csv.writer configuration
John Bodley created AIRFLOW-259:
-----------------------------------
Summary: MySqlToHiveOperator incorrectly requotes strings due to csv.writer configuration
Key: AIRFLOW-259
URL: https://issues.apache.org/jira/browse/AIRFLOW-259
Project: Apache Airflow
Issue Type: Bug
Components: operators
Reporter: John Bodley
Priority: Minor
The MySqlToHiveOperator has the same issue as the HiveToMySqlOperator when bulk_load=True, i.e. the both use a csv.writer as an intermediate step, which mangles strings containing quotes due to the quoting logic.
For example a JSON blob in MySQL of the form
{"dashboard_id": "85"}
becomes
"{""dashboard_id"": 85}"
in Hive.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)