You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Emilio Lahr-Vivaz (JIRA)" <ji...@apache.org> on 2017/02/08 17:40:41 UTC

[jira] [Commented] (ARROW-542) [Java] Implement dictionaries in stream/file encoding

    [ https://issues.apache.org/jira/browse/ARROW-542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15858295#comment-15858295 ] 

Emilio Lahr-Vivaz commented on ARROW-542:
-----------------------------------------

[~wesmckinn] I'm looking into how dictionary vectors will be encoded in the file format. In the current message definitions, it appears dictionary batches are distinct from regular batches, and have an ID associated with them: https://github.com/apache/arrow/blob/b99d049c3d1894908b7e52774eb657675dc1f439/format/Message.fbs#L284
Wouldn't the dictionary already be defined by the Field? I'm unclear what the ID in the DictionaryBatch is supposed to represent.
Thanks,

> [Java] Implement dictionaries in stream/file encoding
> -----------------------------------------------------
>
>                 Key: ARROW-542
>                 URL: https://issues.apache.org/jira/browse/ARROW-542
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Java - Vectors
>            Reporter: Emilio Lahr-Vivaz
>            Assignee: Emilio Lahr-Vivaz
>




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)