You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Andy Grove (Jira)" <ji...@apache.org> on 2020/10/07 23:06:00 UTC

[jira] [Created] (ARROW-10226) [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset

Andy Grove created ARROW-10226:
----------------------------------

             Summary: [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset
                 Key: ARROW-10226
                 URL: https://issues.apache.org/jira/browse/ARROW-10226
             Project: Apache Arrow
          Issue Type: Bug
          Components: Rust, Rust - DataFusion
            Reporter: Andy Grove
            Assignee: Andy Grove
             Fix For: 2.0.0


I re-installed my desktop a few days ago and when I try and run the TPC-H benchmark, it never completes and eventually uses up all 64 GB RAM.

I can run Spark against the data  set and the query completes in 24 seconds, which IIRC is how long it took before.

It is possible that something is odd on my environment, but it is also possible/likely that this is a real bug.

I am investigating this and will update the Jira once I know more.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)