You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Jonathan Keane (Jira)" <ji...@apache.org> on 2021/11/10 15:53:00 UTC

[jira] [Commented] (ARROW-14663) [R] Expose number of threads in read_csv_arrow() and write_csv_arrow()

    [ https://issues.apache.org/jira/browse/ARROW-14663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17441802#comment-17441802 ] 

Jonathan Keane commented on ARROW-14663:
----------------------------------------

Could you go into more detail about this:

> Some functionality (not yet the CSV read or write) allows the user to use the global CPU thread pool, but num_threads would offer more granular control.

Are you seeing that cvs reading is not multi-threading? OR not responding to setting the option(s) that arrow uses to control those? Something else?

> [R] Expose number of threads in read_csv_arrow() and write_csv_arrow()
> ----------------------------------------------------------------------
>
>                 Key: ARROW-14663
>                 URL: https://issues.apache.org/jira/browse/ARROW-14663
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: R
>            Reporter: Dragoș Moldovan-Grünfeld
>            Priority: Minor
>
> As of {{readr}} 2.0.0 (and the switch to {{vroom}}) both {{read_csv()}} and {{write_csv()}} allow the user to pass the number of threads to be used when processing (the {{num_threads}} argument). Currently this functionality is not exposed in Arrow. Some functionality (not yet the CSV read or write) allows the user to use the global CPU thread pool, but {{num_threads}} would offer more granular control. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)