You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/05/12 17:25:02 UTC

[GitHub] [arrow-datafusion] Dandandan opened a new issue #331: Implement vectorized hashing for dictionary types

Dandandan opened a new issue #331:
URL: https://github.com/apache/arrow-datafusion/issues/331


   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   Currently only primitive types and string types are supported as dictionary keys.
   For hash join and hash aggregate we need to hash values.
   
   **Describe the solution you'd like**
   Implement dictionary type in `create_hashes` function.
   
   **Describe alternatives you've considered**
   n/a
   
   **Additional context**
   Dictionary types are supported for group by now via `create_key_for_col`. There is an explicit check on using hash repartitioning for dictionary keys in the physical planner. This check can be removed when dictionaries are supported in  `create_hashes`.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org