You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Kazuaki Ishizaki (Jira)" <ji...@apache.org> on 2021/11/12 07:09:00 UTC

[jira] [Updated] (ARROW-14686) [Python] Current endian decision for numpy does not work for built-in data type.

     [ https://issues.apache.org/jira/browse/ARROW-14686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kazuaki Ishizaki updated ARROW-14686:
-------------------------------------
    Description: 
Although the current NumPy converter code tries to detect endian against a given object, it does not work for a built-in object. This is because a build-in object may have {{native |}} byte order.

This issue causes test failure as follows:
{code:java}
FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000)
FAILED pyarrow/tests/test_array.py::test_array_from_strided - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000)
FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000) {code}
 

  was:Although the current NumPy converter code tries to detect endian against a given object, it does not work for a built-in object. This is because a build-in object may have `native |` byte order.


> [Python] Current endian decision for numpy does not work for built-in data type.
> --------------------------------------------------------------------------------
>
>                 Key: ARROW-14686
>                 URL: https://issues.apache.org/jira/browse/ARROW-14686
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 7.0.0
>            Reporter: Kazuaki Ishizaki
>            Assignee: Kazuaki Ishizaki
>            Priority: Major
>             Fix For: 7.0.0
>
>
> Although the current NumPy converter code tries to detect endian against a given object, it does not work for a built-in object. This is because a build-in object may have {{native |}} byte order.
> This issue causes test failure as follows:
> {code:java}
> FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000)
> FAILED pyarrow/tests/test_array.py::test_array_from_strided - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000)
> FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 - UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3: code point not in range(0x110000) {code}
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)