You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/01/07 14:57:37 UTC

[GitHub] [hudi] xiarixiaoyao commented on pull request #4533: [HUDI-2682] Spark schema not updated with new columns on hive sync

xiarixiaoyao commented on pull request #4533:
URL: https://github.com/apache/hudi/pull/4533#issuecomment-1007472831


   @parisni    we want sparksql tread hudi as DataSource table to have a better performace. 
   when spark read dataSource table, spark will restore table metadata from table properties(include table schema )
   you can see the original code in spark HiveExternalCatalog.restoreTableMetadata 
      * It reads table schema, provider, partition column names and bucket specification from table
      * properties, and filter out these special entries from table properties.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org