You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2020/07/31 15:23:00 UTC

[jira] [Created] (ARROW-9609) [C++] CSV datasets don't materialize virtual columns

Neal Richardson created ARROW-9609:
--------------------------------------

             Summary: [C++] CSV datasets don't materialize virtual columns
                 Key: ARROW-9609
                 URL: https://issues.apache.org/jira/browse/ARROW-9609
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++
            Reporter: Neal Richardson
            Assignee: Ben Kietzman
             Fix For: 2.0.0, 1.0.1


They're included but the values are all null. 

Minimal reproducer in R:

{code:r}
library(arrow)
dir.create("testds/c=5", recursive = TRUE)
cat("a,b\n1,2\n", file = "testds/c=5/test.csv")
ds <- open_dataset("testds", format = "csv")
ds
## FileSystemDataset with 1 csv file
## a: int64
## b: int64
## c: int32
dplyr::collect(ds)
## # A tibble: 1 x 3
##       a     b     c
##   <int> <int> <int>
## 1     1     2    NA
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)