You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/01/05 20:59:00 UTC

[jira] [Commented] (ARROW-10414) [R] open_dataset doesn't work with absolute/expanded paths on Windows

    [ https://issues.apache.org/jira/browse/ARROW-10414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17259207#comment-17259207 ] 

Neal Richardson commented on ARROW-10414:
-----------------------------------------

I can't reproduce this. Our tests use temp dirs, which are absolute paths, and they pass on Windows just fine. To confirm that it's not an issue of the "C:" drive prefix, I modified the test setup [here|https://github.com/apache/arrow/blob/master/r/tests/testthat/test-dataset.R#L90] to prune the drive, and it still worked:

{code}
> dataset_dir
[1] "C:/Users/npr/AppData/Local/Temp/Rtmp440fTQ/file13806c8a1ef5"
> dir.exists(dataset_dir)
[1] TRUE
> no_drive <- substr(dataset_dir, 3, nchar(dataset_dir))
> no_drive
[1] "/Users/npr/AppData/Local/Temp/Rtmp440fTQ/file13806c8a1ef5"
> dir.exists(no_drive)
[1] TRUE
> ds2 <- open_dataset(no_drive,  partitioning = schema(part = uint8()))
> ds2
FileSystemDataset with 2 Parquet files
int: int32
dbl: double
lgl: bool
chr: string
fct: dictionary<values=string, indices=int32>
ts: timestamp[us, tz=UTC]
part: uint8
> collect(ds2)
# A tibble: 20 x 7
...
{code}

Reading your error message, it looks like the problem was that "/data" does not exist.

> [R] open_dataset doesn't work with absolute/expanded paths on Windows
> ---------------------------------------------------------------------
>
>                 Key: ARROW-10414
>                 URL: https://issues.apache.org/jira/browse/ARROW-10414
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 2.0.0
>         Environment: platform       x86_64-w64-mingw32          
> arch           x86_64                      
> os             mingw32                     
> system         x86_64, mingw32             
> status                                     
> major          3                           
> minor          6.3                         
> year           2020                        
> month          02                          
> day            29                          
> svn rev        77875                       
> language       R                           
> version.string R version 3.6.3 (2020-02-29)
> nickname       Holding the Windsock       
>            Reporter: Christian M
>            Priority: Trivial
>             Fix For: 3.0.0
>
>         Attachments: image-2020-10-28-14-57-01-952.png
>
>
> open_dataset does not work inside an R project when a folder is defined with a '/':
>  
> !image-2020-10-28-14-57-01-952.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)