You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2020/12/07 21:15:11 UTC
[GitHub] [arrow] Dandandan edited a comment on pull request #8863: ARROW-10837: [Rust][DataFusion] Use `Vec` for hash join keys
Dandandan edited a comment on pull request #8863:
URL: https://github.com/apache/arrow/pull/8863#issuecomment-740183272
As a next step after this, I think it would be interesting if we can have a look at calculating the hashes on the columns instead to benefit from the columnar data layout.
Some material I found on this:
https://www.cockroachlabs.com/blog/vectorized-hash-joiner/ (simple explanation)
https://pure.uva.nl/ws/files/4321270/68049_09.pdf
Please add if you know of more/newer material!
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org