You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "rsangole (via GitHub)" <gi...@apache.org> on 2024/03/15 16:56:18 UTC

[I] Change in how Spaces are handled in file names [arrow]

rsangole opened a new issue, #40582:
URL: https://github.com/apache/arrow/issues/40582

   ### Describe the usage question you have. Please include as many useful details as  possible.
   
   
   Hello,
   
   Long time `{arrow}` user.
   
   I have a production project (using R) which uses `arrow@10.0.0`. I'm performing a migration activity to update packages, and am trying out v15.
   
   One breaking change on my end is how spaces are handled in file names. Unsure which version this changed along the way, but, for a grouped dataset:
   
   In v10 :
   
   <img width="139" alt="image" src="https://github.com/apache/arrow/assets/6344050/e4c602e6-f512-4aef-80fa-29789d6416c1">
   
   In v15:
   
   <img width="134" alt="image" src="https://github.com/apache/arrow/assets/6344050/f27035b0-f48f-479c-921d-074c60ee5242">
   
   Unfortunately, this has become a breaking change, since I have a storage system which cannot accept `%` in the filename.
   
   Would you know if this is a change in `arrow` or a change in some dependent library like `dplyr`? 
   
   Is there a way to _not_ convert the spaces to `%20` in the file names?
   
   Thanks!
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] Change in how Spaces are handled in file names [arrow]

Posted by "amoeba (via GitHub)" <gi...@apache.org>.
amoeba commented on issue #40582:
URL: https://github.com/apache/arrow/issues/40582#issuecomment-2000099332

   Hi @rsangole, thanks for letting us know. This is caused by the breaking change made in v11 which was made to address https://github.com/apache/arrow/issues/33448. Some discussion in a related issue, https://github.com/apache/arrow/issues/34905, lead to a proposal to add a toggle to turn this new routine on and off so I think that will do the trick for you.
   
   Feel free to subscribe to it for the time being, I see it's had some recent activity so hopefully a PR will be put up soon.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org