You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Shubham Chaudhary (JIRA)" <ji...@apache.org> on 2018/01/10 10:17:00 UTC
[jira] [Created] (ARROW-1981) UnicodeEncodeError in column name
Shubham Chaudhary created ARROW-1981:
----------------------------------------
Summary: UnicodeEncodeError in column name
Key: ARROW-1981
URL: https://issues.apache.org/jira/browse/ARROW-1981
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.8.0
Reporter: Shubham Chaudhary
Priority: Minor
[Related code|https://github.com/apache/arrow/blob/ee134f0e4b285e17f33a56e7750836dd82a13735/python/pyarrow/pandas_compat.py#L139-#L179]
{{Traceback (most recent call last):
File "project/folder/file.py", line 34, in <module>
main()
File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 722, in __call__
return self.main(*args, **kwargs)
File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 697, in main
rv = self.invoke(ctx)
File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 895, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 535, in invoke
return callback(*args, **kwargs)
File "project/folder/file.py", line 29, in main
df.to_parquet(pq_filename)
File "/opt/conda/lib/python2.7/site-packages/pandas/core/frame.py", line 1649, in to_parquet
compression=compression, **kwargs)
File "/opt/conda/lib/python2.7/site-packages/pandas/io/parquet.py", line 178, in to_parquet
return impl.write(df, path, compression=compression)
File "/opt/conda/lib/python2.7/site-packages/pandas/io/parquet.py", line 74, in write
table = self.api.Table.from_pandas(df)
File "pyarrow/table.pxi", line 875, in pyarrow.lib.Table.from_pandas (/arrow/python/build/temp.linux-x86_64-2.7/lib.cxx:44927)
File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 361, in dataframe_to_arrays
df, names, index_columns, preserve_index, types
File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 211, in construct_metadata
df.columns, column_names, df_types
File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 175, in get_column_metadata
'field_name': str(field_name),
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 31: ordinal not in range(128)
}}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)