You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "pitrou (via GitHub)" <gi...@apache.org> on 2023/09/04 15:09:42 UTC

[GitHub] [arrow] pitrou commented on issue #37389: [Python] Reading parquet file with map without annotated map keys

pitrou commented on issue #37389:
URL: https://github.com/apache/arrow/issues/37389#issuecomment-1705425494

   > Technically this can be fixed by postponing the exception until a real null map key data has been caught.
   
   That sounds like quite a bit of additional maintenance. Also, I'm not sure delaying the error until actual nulls are read is really user-friendly...
   
   > I think whether we can regard that as a `List<Struct<Key, Value>>` and `Cast` to a Map
   
   This has the same problem of additional maintenance and delaying the error until much later.
   
   This might not be an immediate solution, but perhaps a third-party utility could be created to "fix" non-conforming Parquet files by rewriting them? It could address a growing number of non-conformity issues while not forcing mainline implementations to reimplement compatibility hacks.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org