You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by "RTEnzyme (via GitHub)" <gi...@apache.org> on 2023/02/16 13:13:23 UTC

[GitHub] [arrow-datafusion] RTEnzyme opened a new issue, #5301: data file without suffix can't be read correctly

RTEnzyme opened a new issue, #5301:
URL: https://github.com/apache/arrow-datafusion/issues/5301

   **Describe the bug**
   My data files in linux don't have ".csv" suffix. When I use ctx.read_csv("path/to/csv"), It read blank data rather than the data in my files or an error.
   
   **To Reproduce**
   Use the case in the example `csv_sql.rs`. I remove the csv file's suffix and cpy /path/to/aggregate_test_100.csv /path/to/aggregate_test_100.
   ```Rust
     // register csv file with the execution context
     ctx.register_csv(
         "aggregate_test_100",
         &format!("{testdata}/csv/aggregate_test_100"),
         CsvReadOptions::new(),
     )
     .await?;
    // execute the query
       let df = ctx
           .sql(
               "SELECT c1, MIN(c12), MAX(c12) \
           FROM aggregate_test_100 \
           WHERE c11 > 0.1 AND c11 < 0.9 \
           GROUP BY c1",
           )
           .await?;
   
       // print the results
       df.show().await?;
   ```
   
   The result:
   ```
   Error: SchemaError(FieldNotFound { field: Column { relation: None, name: "c11" }, valid_fields: [] })
   ```
   
   
   
   **Expected behavior**
   I think that either the file can be read correctly in this case, or an error that the file suffix does not match will be reported.
   
   **Additional context**
   Add any other context about the problem here.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [arrow-datafusion] RTEnzyme closed issue #5301: data file without suffix can't be read correctly

Posted by "RTEnzyme (via GitHub)" <gi...@apache.org>.

RTEnzyme closed issue #5301: data file without suffix can't be read correctly
URL: https://github.com/apache/arrow-datafusion/issues/5301


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [arrow-datafusion] RTEnzyme commented on issue #5301: data file without suffix can't be read correctly

Posted by "RTEnzyme (via GitHub)" <gi...@apache.org>.

RTEnzyme commented on issue #5301:
URL: https://github.com/apache/arrow-datafusion/issues/5301#issuecomment-1434033104

   > I think this is a duplicate of #1736
   
   Emm..I think so..


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [arrow-datafusion] tustvold commented on issue #5301: data file without suffix can't be read correctly

Posted by "tustvold (via GitHub)" <gi...@apache.org>.

tustvold commented on issue #5301:
URL: https://github.com/apache/arrow-datafusion/issues/5301#issuecomment-1433388237

   I think this is a duplicate of https://github.com/apache/arrow-datafusion/issues/1736


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org