You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "Jefffrey (via GitHub)" <gi...@apache.org> on 2023/02/07 09:03:20 UTC

[GitHub] [arrow-datafusion] Jefffrey commented on issue #5187: Dataframe API adds ?table? qualifier

Jefffrey commented on issue #5187:
URL: https://github.com/apache/arrow-datafusion/issues/5187#issuecomment-1420422685

   This seems to be expected behaviour. A Dataframe has a default name of `?table?` if none is specified:
   
   https://github.com/apache/arrow-datafusion/blob/48732b4cb2c8e42fbe5be295429bbc465e5f5491/datafusion/expr/src/logical_plan/builder.rs#L52
   
   In the error in the original issue post, it is due to you attempting to select the column `last_modified` whereas the correct name is `last_modified_date`
   
   For the error in the subsequent comment, it is because you need to quote the identifiers like so:
   
   ```rust
   let df = df.select(vec![
           col("\"?table?\".key").alias("key"),
           col("\"?table?\".size").alias("size"),
           col("\"?table?\".last_modified").alias("last_modified"),
       ])?;
   ```
   
   Otherwise the parser will assume the entire string is the name of the column, instead of being able to detect there are two identifiers delimited by the period (where first identifier is quoted by double quotes `"`).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org