You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Apache Arrow JIRA Bot (Jira)" <ji...@apache.org> on 2022/10/13 17:52:00 UTC

[jira] [Commented] (ARROW-12632) [C++][Dataset][Compute] Add support for dictionary_encode to Expression

    [ https://issues.apache.org/jira/browse/ARROW-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17617227#comment-17617227 ] 

Apache Arrow JIRA Bot commented on ARROW-12632:
-----------------------------------------------

This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned per [project policy|https://arrow.apache.org/docs/dev/developers/bug_reports.html#issue-assignment]. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.

> [C++][Dataset][Compute] Add support for dictionary_encode to Expression
> -----------------------------------------------------------------------
>
>                 Key: ARROW-12632
>                 URL: https://issues.apache.org/jira/browse/ARROW-12632
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>    Affects Versions: 4.0.0
>            Reporter: Ben Kietzman
>            Assignee: Ben Kietzman
>            Priority: Major
>              Labels: dataset
>
> dictionary_encode should be usable in the context of ExecuteScalarExpression, but is not currently supported because it requires mutable state (the hash table). Currently scanning assumes that Expression state will not be mutated so only one instance is initialized and is shared between all threads of execution. Supporting dictionary_encode will require adding support for multiple states to Expression and usage of that by dataset scans.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)