You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/05 20:03:06 UTC

[GitHub] [arrow-datafusion] alamb opened a new issue, #2165: Optimize InList implementation with native types rather than ScalarValue

alamb opened a new issue, #2165:
URL: https://github.com/apache/arrow-datafusion/issues/2165

   The basic idea is that instead of using `ScalarValue` which has runtime dispatch overhead, we could use native rust types, like `HashSet<u32>`, etc. I think that could be a couple times faster.
   
   
   _Originally posted by @Dandandan in https://github.com/apache/arrow-datafusion/pull/2156#discussion_r841898112_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] tustvold closed issue #2165: Optimize InList implementation with native types rather than ScalarValue

Posted by GitBox <gi...@apache.org>.
tustvold closed issue #2165: Optimize InList implementation with native types rather than ScalarValue
URL: https://github.com/apache/arrow-datafusion/issues/2165


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] Dandandan commented on issue #2165: Optimize InList implementation with native types rather than ScalarValue

Posted by GitBox <gi...@apache.org>.
Dandandan commented on issue #2165:
URL: https://github.com/apache/arrow-datafusion/issues/2165#issuecomment-1089292408

   In addition to higher memory usage and dispatching overhead there are two extra sources of overhead:
   
   * Having to convert all values from array items to `ScalarValue`
   * Hashing a `Scalarvalue` is slower than hashing a native type.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org