You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "thisisnic (via GitHub)" <gi...@apache.org> on 2023/03/17 15:07:04 UTC

[GitHub] [arrow] thisisnic opened a new issue, #34609: [R] Unhelpful error message when creating a dataset from datasets of differing file types

thisisnic opened a new issue, #34609:
URL: https://github.com/apache/arrow/issues/34609

   ### Describe the bug, including details regarding any error messages, version, and platform.
   
   I'm fairly sure this isn't supported anyway, and so we should add in some validation so we get a better error message.
   
   ``` r
   library(arrow)
   tf <- tempfile()
   dir.create(tf)
   
   arrow::write_csv_arrow(mtcars, file.path(tf, "mtcars.csv"))
   arrow::write_parquet(mtcars, file.path(tf, "mtcars.parquet"))
   
   schema <- schema(mpg = float64(), cyl = int64(), disp = float64(), hp = int64(), 
       drat = float64(), wt = float64(), qsec = float64(), vs = int64(), 
       am = int64(), gear = int64(), carb = int64())
   
   csv_dataset <-
     open_dataset(tf,
                  format = "csv",
                  factory_options = list(exclude_invalid_files = TRUE),
                  schema = schema)
   
   parquet_dataset <-
     open_dataset(tf,
                  format = "parquet",
                  factory_options = list(exclude_invalid_files = TRUE), schema = schema)
   
   open_dataset(c(csv_dataset, parquet_dataset))
   #> Error: x is not a character vector
   ```
   
   <sup>Created on 2023-03-17 with [reprex v2.0.2](https://reprex.tidyverse.org)</sup>
   
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] thisisnic commented on issue #34609: [R] Unhelpful error message when creating a dataset from datasets of differing file types

Posted by "thisisnic (via GitHub)" <gi...@apache.org>.
thisisnic commented on issue #34609:
URL: https://github.com/apache/arrow/issues/34609#issuecomment-1473991897

   This is actually do to the missing `skip` in the CSV dataset; we should see if we can surface the appropriate error message better.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org