You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "tdhock (via GitHub)" <gi...@apache.org> on 2023/06/27 01:24:39 UTC

[GitHub] [arrow] tdhock commented on pull request #36304: GH-36121: [R] Warn for `set_io_thread_count()` with `num_threads` < 2

tdhock commented on PR #36304:
URL: https://github.com/apache/arrow/pull/36304#issuecomment-1608564686

   hi @paleolimbot thanks writing a PR that partially addresses the issue I created. 
   To fully address that issue, can you please add documentation to (1) clarify the difference between IO threads and CPU threads, and (2) to explain which function should be called to control the CSV reading operation?
   For (1) a link to the C++ doc web page https://arrow.apache.org/docs/cpp/threading.html would be very helpful.
   Could a link to that page be added on the R man pages for arrow::cpu_count and arrow::io_thread_count?
   For (2) I would have expected some mention of how to control number of threads used for CSV reading on the man page for read_csv_arrow, but there is no mention of threads on that man page. Something like "use arrow::set_cpu_count(N_CPUS) to tell arrow to use N_CPUS for reading the CSV file" on that man page would be useful.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org