You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Yuyin Yang (JIRA)" <ji...@apache.org> on 2018/03/03 14:34:00 UTC

[jira] [Commented] (AIRFLOW-2118) get_pandas_df does always pass a list of rows to be parsed

    [ https://issues.apache.org/jira/browse/AIRFLOW-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16384677#comment-16384677 ] 

Yuyin Yang commented on AIRFLOW-2118:
-------------------------------------

[~divy] Hey, I met the same problem. Do you mind opening an PR in the repository to fix it?

> get_pandas_df does always pass a list of rows to be parsed
> ----------------------------------------------------------
>
>                 Key: AIRFLOW-2118
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-2118
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: contrib, hooks
>    Affects Versions: 1.9.0
>         Environment: pandas-gbp 0.3.1
>            Reporter: Diane Ivy
>            Priority: Minor
>              Labels: easyfix
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> While trying to parse the pages in get_pandas_df if only one page is returned it starts popping off each row and then the gbq_parse_data works incorrectly.
> {{while len(pages) > 0:}}
> {{    page = pages.pop()}}
> {{    dataframe_list.append(gbq_parse_data(schema, page))}}
> Possible solution:
> {{from google.cloud import bigquery}}
> {{if isinstance(pages[0], bigquery.table.Row):}}
> {{    pages = [pages]}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)