You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Shubham Chaudhary (JIRA)" <ji...@apache.org> on 2018/01/13 07:28:00 UTC

[jira] [Closed] (ARROW-1981) UnicodeEncodeError for column name in pandas_compat.py

     [ https://issues.apache.org/jira/browse/ARROW-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Shubham Chaudhary closed ARROW-1981.
------------------------------------
    Resolution: Duplicate

> UnicodeEncodeError for column name in pandas_compat.py
> ------------------------------------------------------
>
>                 Key: ARROW-1981
>                 URL: https://issues.apache.org/jira/browse/ARROW-1981
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.8.0
>            Reporter: Shubham Chaudhary
>            Priority: Minor
>              Labels: easyfix
>   Original Estimate: 0.5h
>  Remaining Estimate: 0.5h
>
> [Related code|https://github.com/apache/arrow/blob/ee134f0e4b285e17f33a56e7750836dd82a13735/python/pyarrow/pandas_compat.py#L139-#L179]
> {{
> Traceback (most recent call last):
>   File "project/folder/file.py", line 34, in <module>
>     main()
>   File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 722, in __call__
>     return self.main(*args, **kwargs)
>   File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 697, in main
>     rv = self.invoke(ctx)
>   File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 895, in invoke
>     return ctx.invoke(self.callback, **ctx.params)
>   File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 535, in invoke
>     return callback(*args, **kwargs)
>   File "project/folder/file.py", line 29, in main
>     df.to_parquet(pq_filename)
>   File "/opt/conda/lib/python2.7/site-packages/pandas/core/frame.py", line 1649, in to_parquet
>     compression=compression, **kwargs)
>   File "/opt/conda/lib/python2.7/site-packages/pandas/io/parquet.py", line 178, in to_parquet
>     return impl.write(df, path, compression=compression)
>   File "/opt/conda/lib/python2.7/site-packages/pandas/io/parquet.py", line 74, in write
>     table = self.api.Table.from_pandas(df)
>   File "pyarrow/table.pxi", line 875, in pyarrow.lib.Table.from_pandas (/arrow/python/build/temp.linux-x86_64-2.7/lib.cxx:44927)
>   File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 361, in dataframe_to_arrays
>     df, names, index_columns, preserve_index, types
>   File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 211, in construct_metadata
>     df.columns, column_names, df_types
>   File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 175, in get_column_metadata
>     'field_name': str(field_name),
> UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 31: ordinal not in range(128)
> }}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)