You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Dewey Dunnington (Jira)" <ji...@apache.org> on 2022/12/13 15:17:00 UTC

[jira] [Comment Edited] (ARROW-17332) [R] error parsing folder path with accent ('c:/Público') in read_csv_arrow

    [ https://issues.apache.org/jira/browse/ARROW-17332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17646696#comment-17646696 ] 

Dewey Dunnington edited comment on ARROW-17332 at 12/13/22 3:16 PM:
--------------------------------------------------------------------

Ok, I have a PR up for this: https://github.com/apache/arrow/pull/14930

Since it's a little hard to test Arrow dev PRs, I worked up the following that should give me an idea of whether or not the PR is going to actually solve this. Is there any chance [~lucasmation] you could give this a try?

{code:R}
library(arrow, warn.conflicts = FALSE)
#> Some features are not enabled in this build of Arrow. Run `arrow_info()` for more information.

f <- tempfile(fileext = "Público")
write.csv(mtcars, f, row.names = FALSE)

# this should fail based on the issue report
mtcars2 <- read_csv_arrow(f)

# if the PR interpreted the problem correctly, this should work
# (circumvent any Arrow filename handling)
fs <- LocalFileSystem$create()
marked_path <- enc2utf8(normalizePath(f, winslash = "/", mustWork = FALSE))
file <- arrow:::fs___FileSystem__OpenInputFile(fs, marked_path)
mtcars2 <- read_csv_arrow(file)
nrow(mtcars2)
#> [1] 32
{code}



was (Author: paleolimbot):
Ok, I have a PR up for this:

Since it's a little hard to test Arrow dev PRs, I worked up the following that should give me an idea of whether or not the PR is going to actually solve this. Is there any chance [~lucasmation] you could give this a try?

{code:R}
library(arrow, warn.conflicts = FALSE)
#> Some features are not enabled in this build of Arrow. Run `arrow_info()` for more information.

f <- tempfile(fileext = "Público")
write.csv(mtcars, f, row.names = FALSE)

# this should fail based on the issue report
mtcars2 <- read_csv_arrow(f)

# if the PR interpreted the problem correctly, this should work
# (circumvent any Arrow filename handling)
fs <- LocalFileSystem$create()
marked_path <- enc2utf8(normalizePath(f, winslash = "/", mustWork = FALSE))
file <- arrow:::fs___FileSystem__OpenInputFile(fs, marked_path)
mtcars2 <- read_csv_arrow(file)
nrow(mtcars2)
#> [1] 32
{code}


> [R] error parsing folder path with accent ('c:/Público') in read_csv_arrow
> --------------------------------------------------------------------------
>
>                 Key: ARROW-17332
>                 URL: https://issues.apache.org/jira/browse/ARROW-17332
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>            Reporter: Lucas Mation
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 11.0.0
>
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> I am a user trying the R arrow package on a windows machine. 
> To reproduce create a folder name containing a character with Latin accents
> ```
> libary(arrow)
> p <- 'c:/Público'  
> b <- read_csv_arrow(p)
> Error: IOError: Failed to open local file 'c:/Público'. Detail: [Windows error 5] Access is denied.
> ```



--
This message was sent by Atlassian Jira
(v8.20.10#820010)