You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Todd Farmer (Jira)" <ji...@apache.org> on 2022/07/12 14:05:03 UTC

[jira] [Assigned] (ARROW-9657) [R][Dataset] Expose more FileSystemDatasetFactory options

     [ https://issues.apache.org/jira/browse/ARROW-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Todd Farmer reassigned ARROW-9657:
----------------------------------

    Assignee:     (was: Ian Cook)

This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.

> [R][Dataset] Expose more FileSystemDatasetFactory options
> ---------------------------------------------------------
>
>                 Key: ARROW-9657
>                 URL: https://issues.apache.org/jira/browse/ARROW-9657
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: R
>            Reporter: Neal Richardson
>            Priority: Major
>              Labels: dataset
>
> Among the features:
> * ignore_prefixes option
> * Pass an explicit list of files + base directory
> * Exclude invalid files (boolean) option
> An important use case this would allow/fix is being able to open_dataset("really_really_big_file.csv") so you can partition/write it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)