You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Shubham Chaudhary (JIRA)" <ji...@apache.org> on 2018/01/10 10:17:00 UTC

[jira] [Created] (ARROW-1981) UnicodeEncodeError in column name

Shubham Chaudhary created ARROW-1981:
----------------------------------------

             Summary: UnicodeEncodeError in column name
                 Key: ARROW-1981
                 URL: https://issues.apache.org/jira/browse/ARROW-1981
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.8.0
            Reporter: Shubham Chaudhary
            Priority: Minor



[Related code|https://github.com/apache/arrow/blob/ee134f0e4b285e17f33a56e7750836dd82a13735/python/pyarrow/pandas_compat.py#L139-#L179]

{{Traceback (most recent call last):
  File "project/folder/file.py", line 34, in <module>
    main()
  File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 722, in __call__
    return self.main(*args, **kwargs)
  File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 697, in main
    rv = self.invoke(ctx)
  File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 895, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/opt/conda/lib/python2.7/site-packages/click/core.py", line 535, in invoke
    return callback(*args, **kwargs)
  File "project/folder/file.py", line 29, in main
    df.to_parquet(pq_filename)
  File "/opt/conda/lib/python2.7/site-packages/pandas/core/frame.py", line 1649, in to_parquet
    compression=compression, **kwargs)
  File "/opt/conda/lib/python2.7/site-packages/pandas/io/parquet.py", line 178, in to_parquet
    return impl.write(df, path, compression=compression)
  File "/opt/conda/lib/python2.7/site-packages/pandas/io/parquet.py", line 74, in write
    table = self.api.Table.from_pandas(df)
  File "pyarrow/table.pxi", line 875, in pyarrow.lib.Table.from_pandas (/arrow/python/build/temp.linux-x86_64-2.7/lib.cxx:44927)
  File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 361, in dataframe_to_arrays
    df, names, index_columns, preserve_index, types
  File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 211, in construct_metadata
    df.columns, column_names, df_types
  File "/opt/conda/lib/python2.7/site-packages/pyarrow/pandas_compat.py", line 175, in get_column_metadata
    'field_name': str(field_name),
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 31: ordinal not in range(128)
}}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)