You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/03/10 16:48:00 UTC

[jira] [Updated] (ARROW-9657) [R][Dataset] Expose more FileSystemDatasetFactory options

     [ https://issues.apache.org/jira/browse/ARROW-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neal Richardson updated ARROW-9657:
-----------------------------------
    Description: 
Among the features:

* ignore_prefixes option
* Pass an explicit list of files + base directory
* Exclude invalid files (boolean) option

An important use case this would allow/fix is being able to open_dataset("really_really_big_file.csv") so you can partition/write it.

  was:
Among the features:

* ignore_prefixes option
* Pass an explicit list of files + base directory
* Exclude invalid files (boolean) option


> [R][Dataset] Expose more FileSystemDatasetFactory options
> ---------------------------------------------------------
>
>                 Key: ARROW-9657
>                 URL: https://issues.apache.org/jira/browse/ARROW-9657
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: R
>            Reporter: Neal Richardson
>            Assignee: Ian Cook
>            Priority: Major
>              Labels: dataset
>             Fix For: 4.0.0
>
>
> Among the features:
> * ignore_prefixes option
> * Pass an explicit list of files + base directory
> * Exclude invalid files (boolean) option
> An important use case this would allow/fix is being able to open_dataset("really_really_big_file.csv") so you can partition/write it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)