You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Ben Kietzman (Jira)" <ji...@apache.org> on 2021/05/03 15:36:00 UTC

[jira] [Created] (ARROW-12632) [C++][Dataset][Compute] Add support for dictionary_encode to Expression

Ben Kietzman created ARROW-12632:
------------------------------------

             Summary: [C++][Dataset][Compute] Add support for dictionary_encode to Expression
                 Key: ARROW-12632
                 URL: https://issues.apache.org/jira/browse/ARROW-12632
             Project: Apache Arrow
          Issue Type: New Feature
          Components: C++
    Affects Versions: 4.0.0
            Reporter: Ben Kietzman
            Assignee: Ben Kietzman
             Fix For: 5.0.0


dictionary_encode should be usable in the context of ExecuteScalarExpression, but is not currently supported because it requires mutable state (the hash table). Currently scanning assumes that Expression state will not be mutated so only one instance is initialized and is shared between all threads of execution. Supporting dictionary_encode will require adding support for multiple states to Expression and usage of that by dataset scans.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)