You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Nicola Crane (Jira)" <ji...@apache.org> on 2022/03/01 10:54:00 UTC
[jira] [Commented] (ARROW-15812) [R] Allow user to supply col_names argument when reading in a CSV dataset
[ https://issues.apache.org/jira/browse/ARROW-15812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17499452#comment-17499452 ]
Nicola Crane commented on ARROW-15812:
--------------------------------------
Is supporting logical values here actually going to be useful to users (in which case I need to open a C++ component ticket to enable autogeneration of column names for datasets), or would we be better enabling this argument, but testing the user-supplied value and if it's {{TRUE}} or {{FALSE}}, raise an error saying this isn't supported, but if it's a vector of column names just pass it through to the equivalent parameter (i.e. {{column_names}})?
> [R] Allow user to supply col_names argument when reading in a CSV dataset
> -------------------------------------------------------------------------
>
> Key: ARROW-15812
> URL: https://issues.apache.org/jira/browse/ARROW-15812
> Project: Apache Arrow
> Issue Type: Improvement
> Components: R
> Reporter: Nicola Crane
> Priority: Major
>
> Allow the user to supply the {{col_names}} argument from {{readr}} when reading in a dataset.
> This is already possible when reading in a single CSV file via {{arrow::read_csv_arrow()}} via the {{readr_to_csv_read_options}} function, and so once the C++ functionality to autogenerate column names for Datasets is implemented, we should hook up {{readr_to_csv_read_options}} in {{csv_file_format_read_opts}} just like we have with {{readr_to_csv_parse_options}} in {{csv_file_format_parse_options}}.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)