You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by "liningpan (via GitHub)" <gi...@apache.org> on 2023/04/12 19:29:14 UTC

[GitHub] [arrow-datafusion] liningpan commented on issue #1736: Files without `.parquet`, `.csv` extension inferred as having no schema

liningpan commented on issue #1736:
URL: https://github.com/apache/arrow-datafusion/issues/1736#issuecomment-1505811924

   I also ran into this issue. The problem seems to be even if only a single file is provided, we still try to match by extension. 
   
   https://github.com/apache/arrow-datafusion/blob/0e5f6df2c4fb2d647874102a19c74eaaf7f34d98/datafusion/core/src/datasource/listing/url.rs#L143-L175
   
   If my use case always uses a single csv file for each table, would a reasonable workaround be setting file extension to an empty string in `CsvReadOptions`? Are there any problems?
   
   ```rust
   ctx.register_csv(
       name,
       path,
       CsvReadOptions::new().file_extension("")
   ).await?
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org