You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/05/12 13:27:14 UTC

[GitHub] [arrow-datafusion] Dandandan commented on issue #133: Add support for reading partitioned Parquet files

Dandandan commented on issue #133:
URL: https://github.com/apache/arrow-datafusion/issues/133#issuecomment-839771888


   @jorgecarleitao yes
   
   I am also not aware of any standard - also implementations do differ in some subtle ways. I think we have to compare to hive / spark / etc.
   
   On the types - it depends if the type already is set in the schema or if some inference is used for the paths. I think we can first start with adding partition columns to the table schema so we can actually parse the locations based on the type - and add automatic detection of types (like CSV) later.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org