You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Jörn Horstmann (Jira)" <ji...@apache.org> on 2020/10/08 22:01:00 UTC
[jira] [Created] (ARROW-10240) [Rust] [Datafusion] Optionally load
tpch data into memory before running benchmark query
Jörn Horstmann created ARROW-10240:
--------------------------------------
Summary: [Rust] [Datafusion] Optionally load tpch data into memory before running benchmark query
Key: ARROW-10240
URL: https://issues.apache.org/jira/browse/ARROW-10240
Project: Apache Arrow
Issue Type: Improvement
Components: Rust, Rust - DataFusion
Reporter: Jörn Horstmann
The tpch benchmark runtime seems to be dominated by csv parsing code and it is really difficult to see any performance hotspots related to actual query execution in a flamegraph.
With the date in memory and more iterations it should be easier to profile and find bottlenecks.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)