You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Roc Granada Verdú (Jira)" <ji...@apache.org> on 2020/11/03 20:01:00 UTC

[jira] [Commented] (ARROW-10372) [C++][Dataset] Read compressed CSVs

    [ https://issues.apache.org/jira/browse/ARROW-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17225643#comment-17225643 ] 

Roc Granada Verdú commented on ARROW-10372:
-------------------------------------------

I would also like for this to be implemented.

While the _read_csv_arrow_ function automatically detects compression: "_compression will be detected from the file extension and handled automatically_", the _open_dataset_ function does not. It breaks when trying to read a gzip csv, and there is not a parameter to specify the compression.

 

> [C++][Dataset] Read compressed CSVs 
> ------------------------------------
>
>                 Key: ARROW-10372
>                 URL: https://issues.apache.org/jira/browse/ARROW-10372
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, R
>            Reporter: Martin du Toit
>            Assignee: Ben Kietzman
>            Priority: Major
>              Labels: dataset
>             Fix For: 3.0.0
>
>
> It would be nice if arrow can read compressed csv files



--
This message was sent by Atlassian Jira
(v8.3.4#803005)