You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Yuyin Yang (JIRA)" <ji...@apache.org> on 2018/03/03 14:34:00 UTC
[jira] [Commented] (AIRFLOW-2118) get_pandas_df does always pass a
list of rows to be parsed
[ https://issues.apache.org/jira/browse/AIRFLOW-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16384677#comment-16384677 ]
Yuyin Yang commented on AIRFLOW-2118:
-------------------------------------
[~divy] Hey, I met the same problem. Do you mind opening an PR in the repository to fix it?
> get_pandas_df does always pass a list of rows to be parsed
> ----------------------------------------------------------
>
> Key: AIRFLOW-2118
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2118
> Project: Apache Airflow
> Issue Type: Bug
> Components: contrib, hooks
> Affects Versions: 1.9.0
> Environment: pandas-gbp 0.3.1
> Reporter: Diane Ivy
> Priority: Minor
> Labels: easyfix
> Original Estimate: 1h
> Remaining Estimate: 1h
>
> While trying to parse the pages in get_pandas_df if only one page is returned it starts popping off each row and then the gbq_parse_data works incorrectly.
> {{while len(pages) > 0:}}
> {{ page = pages.pop()}}
> {{ dataframe_list.append(gbq_parse_data(schema, page))}}
> Possible solution:
> {{from google.cloud import bigquery}}
> {{if isinstance(pages[0], bigquery.table.Row):}}
> {{ pages = [pages]}}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)