You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/03/01 14:48:14 UTC

[GitHub] [arrow-datafusion] yjshen commented on pull request #1849: Introduce JIT code generation

yjshen commented on pull request #1849:
URL: https://github.com/apache/arrow-datafusion/pull/1849#issuecomment-1055522926


   > I just had a chance to review this module. Very cool stuff @yjshen 🏅 .. I would be (very) interested in helping push the JIT feature along.
   > 
   > I think using this JIT and the row format to speed up sorting / merging would be a very interesting project and quite relevant to IOx (and thus I could justify spending non trivial time on it). Perhaps I can take a swag at creating some benchmarks or something to kick off the process?
   
   Yes, please go ahead. I have just done some code research on DuckDB's radix sort based on the sort keys in raw bytes format. I think it's great to implement and benchmark the performance here.  [1](https://github.com/duckdb/duckdb/blob/master/src/include/duckdb/common/sort/sort.hpp#L27-L49) [2](https://github.com/duckdb/duckdb/blob/master/src/common/sort/radix_sort.cpp#L274) and [3](https://github.com/duckdb/duckdb/blob/master/src/common/radix.cpp#L22-L77) are worth checking if you want to try out the DuckDB way we've discussed in https://github.com/apache/arrow-datafusion/issues/1708#issuecomment-1029014217
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org