You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Roc Granada Verdú (Jira)" <ji...@apache.org> on 2020/11/03 20:01:00 UTC
[jira] [Commented] (ARROW-10372) [C++][Dataset] Read compressed
CSVs
[ https://issues.apache.org/jira/browse/ARROW-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17225643#comment-17225643 ]
Roc Granada Verdú commented on ARROW-10372:
-------------------------------------------
I would also like for this to be implemented.
While the _read_csv_arrow_ function automatically detects compression: "_compression will be detected from the file extension and handled automatically_", the _open_dataset_ function does not. It breaks when trying to read a gzip csv, and there is not a parameter to specify the compression.
> [C++][Dataset] Read compressed CSVs
> ------------------------------------
>
> Key: ARROW-10372
> URL: https://issues.apache.org/jira/browse/ARROW-10372
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++, R
> Reporter: Martin du Toit
> Assignee: Ben Kietzman
> Priority: Major
> Labels: dataset
> Fix For: 3.0.0
>
>
> It would be nice if arrow can read compressed csv files
--
This message was sent by Atlassian Jira
(v8.3.4#803005)