You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Marcello (JIRA)" <ji...@apache.org> on 2018/02/28 01:00:59 UTC
[jira] [Commented] (ARROW-2228) [Python] Unsigned int type for
arrow Table not supported
[ https://issues.apache.org/jira/browse/ARROW-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16379599#comment-16379599 ]
Marcello commented on ARROW-2228:
---------------------------------
I am pretty sure it was 0.8.0, but I'll double check tomorrow morning.
I tried to compile the latest version to test it but I have not been able.
> [Python] Unsigned int type for arrow Table not supported
> --------------------------------------------------------
>
> Key: ARROW-2228
> URL: https://issues.apache.org/jira/browse/ARROW-2228
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Environment: Ubuntu 16.04
> python3.6.3
> Reporter: Marcello
> Priority: Major
>
> Running this python one-liner
>
> {code:java}
> // code pa.Table.from_pandas(pd.DataFrame({'foo': [np.array([10000000000000000000], dtype=np.uint64)]}))
> {code}
> I get
> {code:java}
> // code ---------------------------------------------------------------------------
> ArrowInvalid Traceback (most recent call last)
> <ipython-input-115-cbc9670d125f> in <module>()
> ----> 1 pa.Table.from_pandas(pd.DataFrame({'foo': [np.array([10000000000000000000], dtype=np.uint64)]}))
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/table.pxi in pyarrow.lib.Table.from_pandas (/arrow/python/build/temp.linux-x86_64-3.6/lib.cxx:44927)()
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/pandas_compat.py in dataframe_to_arrays(df, schema, preserve_index, nthreads)
> 348 arrays = [convert_column(c, t)
> 349 for c, t in zip(columns_to_convert,
> --> 350 convert_types)]
> 351 else:
> 352 from concurrent import futures
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/pandas_compat.py in <listcomp>(.0)
> 347 if nthreads == 1:
> 348 arrays = [convert_column(c, t)
> --> 349 for c, t in zip(columns_to_convert,
> 350 convert_types)]
> 351 else:
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/pandas_compat.py in convert_column(col, ty)
> 343
> 344 def convert_column(col, ty):
> --> 345 return pa.array(col, from_pandas=True, type=ty)
> 346
> 347 if nthreads == 1:
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/array.pxi in pyarrow.lib.array (/arrow/python/build/temp.linux-x86_64-3.6/lib.cxx:29224)()
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/array.pxi in pyarrow.lib._ndarray_to_array (/arrow/python/build/temp.linux-x86_64-3.6/lib.cxx:28465)()
> ~/.virtualenvs/log-archive/lib/python3.6/site-packages/pyarrow/error.pxi in pyarrow.lib.check_status (/arrow/python/build/temp.linux-x86_64-3.6/lib.cxx:8270)()
> ArrowInvalid: trying to convert NumPy type int64 but got uint64
> {code}
>
> the problem possibly relies on the fact that from_pandas doesn't handle the conversion from python object to unsigned integer.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)