You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "felipecrv (via GitHub)" <gi...@apache.org> on 2023/04/10 13:50:16 UTC

[GitHub] [arrow] felipecrv opened a new pull request, #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

felipecrv opened a new pull request, #35001:
URL: https://github.com/apache/arrow/pull/35001

   ### Rationale for this change
   
   Run-end encoded arrays can very compactly represent filters with repeated values and run-end encoded arrays can be filtered efficiently without inflating all the runs for filtering. This PR implements algorithms that make it possible to leverage these possibilities.
   
   ### What changes are included in this PR?
   
   - [x] REE x REE  filters kernels (for the same types supported in `run_end_encode`)
   - [ ] Plain values x REE filter
   - [ ] REE values x Plain filter (boolean bitmap)
   
   ### Are these changes tested?
   
   - [x] Very basic tests
   - [ ] TODO: More tests
   
   ### Are there any user-facing changes?
   
   Users don't control which filter kernels get picked directly, so there isn't a user-facing API area yet.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] alippai commented on pull request #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

Posted by "alippai (via GitHub)" <gi...@apache.org>.
alippai commented on PR #35001:
URL: https://github.com/apache/arrow/pull/35001#issuecomment-1546742869

   @felipecrv do you have any anecdotal benchmark numbers? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] github-actions[bot] commented on pull request #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #35001:
URL: https://github.com/apache/arrow/pull/35001#issuecomment-1546698630

   :warning: GitHub issue #32107 **has been automatically assigned in GitHub** to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] github-actions[bot] commented on pull request #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #35001:
URL: https://github.com/apache/arrow/pull/35001#issuecomment-1501834237

   :warning: GitHub issue #32107 **has been automatically assigned in GitHub** to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] felipecrv commented on pull request #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

Posted by "felipecrv (via GitHub)" <gi...@apache.org>.
felipecrv commented on PR #35001:
URL: https://github.com/apache/arrow/pull/35001#issuecomment-1548034762

   > @felipecrv do you have any anecdotal benchmark numbers?
   
   For PlainxREE, only the intuition that for a sufficiently run-end-compressed filter it will be faster than scanning a full bitmap.
   
   But I will be writing benchmarks for the 3 combinations when I complete the unit tests.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] github-actions[bot] commented on pull request #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #35001:
URL: https://github.com/apache/arrow/pull/35001#issuecomment-1501834205

   * Closes: #32107


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] github-actions[bot] commented on pull request #35001: GH-32107: [C++] Create Filter Kernels for REE arrays and or filters

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #35001:
URL: https://github.com/apache/arrow/pull/35001#issuecomment-1504232277

   :warning: GitHub issue #32107 **has been automatically assigned in GitHub** to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org