You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Andy Grove (Jira)" <ji...@apache.org> on 2020/11/14 16:01:00 UTC
[jira] [Closed] (ARROW-9707) [Rust] [DataFusion] Re-implement
threading model
[ https://issues.apache.org/jira/browse/ARROW-9707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andy Grove closed ARROW-9707.
-----------------------------
Resolution: Invalid
Closing this is as invalid because things have changed a lot since this was filed and we have now moved away from explicit thread management (mostly).
> [Rust] [DataFusion] Re-implement threading model
> ------------------------------------------------
>
> Key: ARROW-9707
> URL: https://issues.apache.org/jira/browse/ARROW-9707
> Project: Apache Arrow
> Issue Type: Sub-task
> Components: Rust, Rust - DataFusion
> Reporter: Andy Grove
> Assignee: Andy Grove
> Priority: Major
> Labels: pull-request-available
> Fix For: 3.0.0
>
> Attachments: image-2020-09-24-22-46-46-959.png
>
> Time Spent: 4h
> Remaining Estimate: 0h
>
> The current threading model is very simple and does not scale. We currently use 1-2 dedicated threads per partition and they all run simultaneously, which is a huge problem if you have more partitions than logical or physical cores.
> This task is to re-implement the threading model so that query execution uses a fixed (configurable) number of threads. Work will be broken down into stages and tasks and each in-process executor (running on a dedicated thread) will process its queue of tasks.
> This process will be driven by a scheduler.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)