You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "felipecrv (via GitHub)" <gi...@apache.org> on 2023/05/24 21:36:44 UTC

[GitHub] [arrow] felipecrv opened a new issue, #35749: [C++] Handle run-end encoded filters in compute kernels

felipecrv opened a new issue, #35749:
URL: https://github.com/apache/arrow/issues/35749

   ### Describe the enhancement requested
   
   Boolean arrays (bitmaps) used to represent filters in Arrow take 1 bit per boolean value. If the filter contains long runs, the filter can be run-end encoded and save even more memory.
   
   Using POPCNT, a bitmap can be scanned efficiently for <64 runs of logical values, but a run-end encoded array gives the lengths of the run directly and go beyond word size per run.
   
   These two observations make the case that, for the right dataset, REE filters can be more efficiently processed in compute kernels.
   
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] pitrou closed issue #35749: [C++] Handle run-end encoded filters in compute kernels

Posted by "pitrou (via GitHub)" <gi...@apache.org>.
pitrou closed issue #35749: [C++] Handle run-end encoded filters in compute kernels
URL: https://github.com/apache/arrow/issues/35749


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org