You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/07 14:33:11 UTC

[GitHub] [arrow] thisisnic commented on pull request #12826: ARROW-15260: [R] open_dataset - add file_name as column

thisisnic commented on PR #12826:
URL: https://github.com/apache/arrow/pull/12826#issuecomment-1091812413

   Currently this fails with this error:
   
   ```
   Error in `handle_csv_read_error()` at r/R/dplyr-collect.R:33:6:
   ! Invalid: No match for FieldRef.Name(__filename) in int: int32
   dbl: double
   lgl: bool
   chr: string
   fct: dictionary<values=string, indices=int32, ordered=0>
   ts: timestamp[us, tz=UTC]
   group: int32
   other: string
   ```
   I think it's something to do with the fact that the new column is not in the schema; if I try to print the `arrow_dplyr_query` object before I collect, I get:
   
   ```
   Error in schm$GetFieldByName(name)$type$ToString() : 
     attempt to apply non-function
   ```
   
   Which appears to come from here: https://github.com/apache/arrow/blob/dd4215573349d10f7af9a1d0f316bfcc51b991e0/r/R/dplyr.R#L97


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org