You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/03/10 16:48:00 UTC
[jira] [Updated] (ARROW-9657) [R][Dataset] Expose more
FileSystemDatasetFactory options
[ https://issues.apache.org/jira/browse/ARROW-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Neal Richardson updated ARROW-9657:
-----------------------------------
Description:
Among the features:
* ignore_prefixes option
* Pass an explicit list of files + base directory
* Exclude invalid files (boolean) option
An important use case this would allow/fix is being able to open_dataset("really_really_big_file.csv") so you can partition/write it.
was:
Among the features:
* ignore_prefixes option
* Pass an explicit list of files + base directory
* Exclude invalid files (boolean) option
> [R][Dataset] Expose more FileSystemDatasetFactory options
> ---------------------------------------------------------
>
> Key: ARROW-9657
> URL: https://issues.apache.org/jira/browse/ARROW-9657
> Project: Apache Arrow
> Issue Type: New Feature
> Components: R
> Reporter: Neal Richardson
> Assignee: Ian Cook
> Priority: Major
> Labels: dataset
> Fix For: 4.0.0
>
>
> Among the features:
> * ignore_prefixes option
> * Pass an explicit list of files + base directory
> * Exclude invalid files (boolean) option
> An important use case this would allow/fix is being able to open_dataset("really_really_big_file.csv") so you can partition/write it.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)