You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "SHIMA Tatsuya (Jira)" <ji...@apache.org> on 2022/02/07 13:46:00 UTC
[jira] [Created] (ARROW-15599) [R] can't explicitly convert a column as a typestamp from CSV (or other delimited) file
SHIMA Tatsuya created ARROW-15599:
-------------------------------------
Summary: [R] can't explicitly convert a column as a typestamp from CSV (or other delimited) file
Key: ARROW-15599
URL: https://issues.apache.org/jira/browse/ARROW-15599
Project: Apache Arrow
Issue Type: Bug
Affects Versions: 6.0.1
Environment: R version 4.1.2 (2021-11-01)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 20.04.3 LTS
Reporter: SHIMA Tatsuya
I tried to read the csv column type as timestamp, but I could only get it to work well when `col_types` was not specified.
I'm sorry if I missed something and this is the expected behavior. (It would be great if you could add an example with `col_types` in the documentation.)
{code:r}
library(arrow)
#>
#> Attaching package: 'arrow'
#> The following object is masked from 'package:utils':
#>
#> timestamp
t_string <- tibble::tibble(
x = "2018-10-07 19:04:05"
)
write_csv_arrow(t_string, "tmp.csv")
read_csv_arrow(
"tmp.csv",
as_data_frame = FALSE
)
#> Table
#> 1 rows x 1 columns
#> $x <timestamp[s]>
read_csv_arrow(
"tmp.csv",
col_names = "x",
col_types = "?",
as_data_frame = FALSE
)
#> Table
#> 2 rows x 1 columns
#> $x <string>
read_csv_arrow(
"tmp.csv",
col_names = "x",
col_types = "T",
as_data_frame = FALSE
)
#> Error: Invalid: In CSV column #0: CSV conversion error to timestamp[s]: invalid value 'x'
read_csv_arrow(
"tmp.csv",
col_names = "x",
col_types = "T",
as_data_frame = FALSE,
timestamp_parsers = "%Y-%m-%d %H:%M:%S"
)
#> Error: Invalid: In CSV column #0: CSV conversion error to timestamp[s]: invalid value 'x'
{code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)