You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2022/03/23 21:51:00 UTC

[jira] [Created] (ARROW-16014) [C++] Create more benchmarks for measuring expression evaluation overhead

Weston Pace created ARROW-16014:
-----------------------------------

             Summary: [C++] Create more benchmarks for measuring expression evaluation overhead
                 Key: ARROW-16014
                 URL: https://issues.apache.org/jira/browse/ARROW-16014
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: Weston Pace
            Assignee: Weston Pace


We have a lot of great benchmarks on individual kernels but I don't know if we have enough benchmarks on the function & expression evaluation itself.

There are some benchmarks (function_benchmark.cc) which measure this but I would like to get a better sense of a "bytes per second" number from the function system for a trivial function (e.g. an identity function that simply returns the values as is).

In addition, we should measure overhead for common tasks like preallocation, etc.

I would also like these benchmarks to be parameterized by batch size.  Being able to run on small batches enables (in theory) better cache utilization.  I suspect that the overhead here may start to become a bottleneck.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)