You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/02/25 20:56:00 UTC

[jira] [Commented] (ARROW-11789) [R] Support vector of multiple file paths in read_*() functions

    [ https://issues.apache.org/jira/browse/ARROW-11789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17291217#comment-17291217 ] 

Neal Richardson commented on ARROW-11789:
-----------------------------------------

I think I'd prefer the latter, though either way we'd support this through open_dataset %>% collect.

> [R] Support vector of multiple file paths in read_*() functions
> ---------------------------------------------------------------
>
>                 Key: ARROW-11789
>                 URL: https://issues.apache.org/jira/browse/ARROW-11789
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: R
>    Affects Versions: 3.0.0
>            Reporter: Ian Cook
>            Priority: Major
>
> In an upcoming version of readr, {{read_csv()}} and friends will support vectors of multiple file paths. Consider adding support for this to the {{arrow::read_*()}} functions. In the {{as_data_frame = FALSE}} case, consider whether the result should be a Table or a Dataset and whether the user should be offered a choice of which, and if Dataset, consider whether there should be options for partitioning.
> Alternatively we might add support for vectors of multiple file paths to {{arrow::open_dataset()}}, and note in the {{read_*()}} function docs to use {{open_dataset()}} for this purpose.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)