You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2019/02/06 04:13:00 UTC

[jira] [Updated] (ARROW-1741) [C++] Comparison function for DictionaryArray to determine if indices are "compatible"

     [ https://issues.apache.org/jira/browse/ARROW-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney updated ARROW-1741:
--------------------------------
    Fix Version/s: 0.14.0

> [C++] Comparison function for DictionaryArray to determine if indices are "compatible"
> --------------------------------------------------------------------------------------
>
>                 Key: ARROW-1741
>                 URL: https://issues.apache.org/jira/browse/ARROW-1741
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>             Fix For: 0.14.0
>
>
> For example, if one array's dictionary is larger than the other, but the overlapping beginning portion is the same, then the respective dictionary indices correspond to the same values. Therefore, in analytics, one may choose to drop the smaller dictionary in favor of the larger dictionary, and this need not incur any computational overhead (beyond comparing the dictionary prefixes -- there may be some way to engineer "dictionary lineage" to make this comparison even cheaper)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)