You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/12/02 05:12:02 UTC

[GitHub] [arrow-datafusion] houqp commented on issue #1383: Reading nested parquet files results in `index out of bounds`

houqp commented on issue #1383:
URL: https://github.com/apache/arrow-datafusion/issues/1383#issuecomment-984299178


   I think this can be fixed with a quick and dirty workaround when we iterate through `row_group_meta.columns()` and only count top level fields by looking at repetition level.
   
   > There is another fact: DataFusion has its own parquet reader - it does NOT use the Arrow-RS/Parquet native implementation. I have no idea why it is so.
   
   We are using the parquet crate you linked right now.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org