You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Mitar (JIRA)" <ji...@apache.org> on 2018/03/05 19:55:00 UTC

[jira] [Created] (ARROW-2264) Efficiently serialize numpy arrays with dtype of unicode fixed length string

Mitar created ARROW-2264:
----------------------------

             Summary: Efficiently serialize numpy arrays with dtype of unicode fixed length string
                 Key: ARROW-2264
                 URL: https://issues.apache.org/jira/browse/ARROW-2264
             Project: Apache Arrow
          Issue Type: Improvement
    Affects Versions: 0.8.0
            Reporter: Mitar


Looking at the numpy array serialization code it seems that if I have a dtype like "<U3" this will go through custom ndarray serializer and not through an efficient one.

{{Example:}}{{>>> np.array(['aaa', 'bbb'])}}
{{array(['aaa', 'bbb'], dtype='<U3')}}

This should be able to work, no? It has fixed offsets and memory layout.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)