You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Nic Crane (Jira)" <ji...@apache.org> on 2021/09/03 10:50:00 UTC

[jira] [Created] (ARROW-13887) [R] Various errors using schemas when reading in CSV file

Nic Crane created ARROW-13887:
---------------------------------

             Summary: [R] Various errors using schemas when reading in CSV file
                 Key: ARROW-13887
                 URL: https://issues.apache.org/jira/browse/ARROW-13887
             Project: Apache Arrow
          Issue Type: Bug
          Components: R
            Reporter: Nic Crane


While reporting another bug, I found an error while working with schemas.  It's not just this particular data type - try changing around the various data types specified and similar errors occur.  Unsure if this is at the R or C++ layer
{code:java}
share_data <- tibble::tibble(
  company = c("AMZN", "GOOG", "BKNG", "TSLA"),
  price = c(3463.12, 2884.38, 2300.46, 732.39),
  date = rep(as.Date("2021-09-03"), 4)
)

readr::write_csv(share_data, file = "share_data.csv")

share_schema <- schema(
  company = utf8(),
  price = float64(),
  date = date32()
)

read_csv_arrow("share_data.csv", schema = share_schema)

{code}
{code:java}
Error: Invalid: In CSV column #1: CSV conversion error to double: invalid value 'price'
/home/nic2/arrow/cpp/src/arrow/csv/converter.cc:492 decoder_.Decode(data, size, quoted, &value)
/home/nic2/arrow/cpp/src/arrow/csv/parser.h:84 status
/home/nic2/arrow/cpp/src/arrow/csv/converter.cc:496 parser.VisitColumn(col_index, visit) {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)