You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2019/10/03 17:05:12 UTC

[GitHub] [incubator-iceberg] rdblue commented on issue #510: Cannot update an Iceberg dataset from a Parquet file due to "field should be required, but is optional"

rdblue commented on issue #510: Cannot update an Iceberg dataset from a Parquet file due to "field should be required, but is optional"
URL: https://github.com/apache/incubator-iceberg/issues/510#issuecomment-538035616
 
 
   > Please note that both have the same schema.
   
   The problem is that these do not have the same schema. If the Parquet schema does not have field IDs, then the IDs are assigned using the fallback logic. The fallback logic matches columns by position, not by name.
   
   If you want to read this file, you need to create a name-to-id schema mapping. Those were added in #338. You'd also need to update the fallback strategy in Parquet to use the schema mapping to assign IDs when one is present, instead of the default position-based approach.
   
   @rdsr, FYI

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org