You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Antoine Pitrou (JIRA)" <ji...@apache.org> on 2018/12/11 15:32:00 UTC

[jira] [Created] (ARROW-3997) [C++] [Doc] Clarify dictionary encoding integer signedness (and width?)

Antoine Pitrou created ARROW-3997:
-------------------------------------

             Summary: [C++] [Doc] Clarify dictionary encoding integer signedness (and width?)
                 Key: ARROW-3997
                 URL: https://issues.apache.org/jira/browse/ARROW-3997
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++, Documentation, Format
    Affects Versions: 0.11.1
            Reporter: Antoine Pitrou


The Arrow spec states that a dictionary-encoded array uses int32 indices. Signed or unsigned? The spec doesn't say.

Also, the C++ implementation supports all kinds of integers as indices (8- to 64-bit, signed and unsigned). I wonder if we should at least mandate a specific signedness.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)