You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/10 13:14:00 UTC

[jira] [Updated] (ARROW-1741) [C++] Comparison function for DictionaryArray to determine if indices are "compatible"

     [ https://issues.apache.org/jira/browse/ARROW-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated ARROW-1741:
----------------------------------
    Labels: pull-request-available  (was: )

> [C++] Comparison function for DictionaryArray to determine if indices are "compatible"
> --------------------------------------------------------------------------------------
>
>                 Key: ARROW-1741
>                 URL: https://issues.apache.org/jira/browse/ARROW-1741
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Assignee: Benjamin Kietzman
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.15.0
>
>
> For example, if one array's dictionary is larger than the other, but the overlapping beginning portion is the same, then the respective dictionary indices correspond to the same values. Therefore, in analytics, one may choose to drop the smaller dictionary in favor of the larger dictionary, and this need not incur any computational overhead (beyond comparing the dictionary prefixes -- there may be some way to engineer "dictionary lineage" to make this comparison even cheaper)



--
This message was sent by Atlassian Jira
(v8.3.2#803003)