You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Dyfan Jones (Jira)" <ji...@apache.org> on 2020/05/26 11:40:00 UTC

[jira] [Updated] (ARROW-8942) [R] support read gzip csv files

     [ https://issues.apache.org/jira/browse/ARROW-8942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dyfan Jones updated ARROW-8942:
-------------------------------
    Description: 
Hi all,

Apologises if this has already been covered by another ticket. Is it possible for arrow to read in compress delimited files (for example gzip)?

Currently I get an error when trying to read in a compressed delimited file:

 
{code:java}
vroom::vroom_write(iris, "iris.csv.gz", delim = ",")
arrow::read_csv_arrow("iris.csv.gz")

# Error in csv__TableReader_Read(self) :
# Invalid: CSV parse error: Expected 1 columns, got 4{code}
however it can be read in by vroom and readr:
{code:java}
vroom::vroom("iris.csv.gz")
readr::read_csv("iris.csv.gz")
{code}
 

 

 

 

  was:
Hi all,

Apologises if this has already been covered by another ticket. Is it possible for arrow to read in compress delimited files (for example gzip)?

Currently I get an error when trying to read in a compressed delimited file:

```
vroom::vroom_write(iris, "iris.csv.gz", delim = ",")

arrow::read_csv_arrow("iris.csv.gz")
# Error in csv___TableReader__Read(self) : 
 # Invalid: CSV parse error: Expected 1 columns, got 4

# however it can be read in by vroom and readr:

vroom::vroom("iris.csv.gz")

readr::read_csv("iris.csv.gz")
```


> [R] support read gzip csv files
> -------------------------------
>
>                 Key: ARROW-8942
>                 URL: https://issues.apache.org/jira/browse/ARROW-8942
>             Project: Apache Arrow
>          Issue Type: New Feature
>            Reporter: Dyfan Jones
>            Priority: Major
>
> Hi all,
> Apologises if this has already been covered by another ticket. Is it possible for arrow to read in compress delimited files (for example gzip)?
> Currently I get an error when trying to read in a compressed delimited file:
>  
> {code:java}
> vroom::vroom_write(iris, "iris.csv.gz", delim = ",")
> arrow::read_csv_arrow("iris.csv.gz")
> # Error in csv__TableReader_Read(self) :
> # Invalid: CSV parse error: Expected 1 columns, got 4{code}
> however it can be read in by vroom and readr:
> {code:java}
> vroom::vroom("iris.csv.gz")
> readr::read_csv("iris.csv.gz")
> {code}
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)