You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "nsivabalan (via GitHub)" <gi...@apache.org> on 2023/03/03 19:06:17 UTC

[GitHub] [hudi] nsivabalan commented on issue #8036: [SUPPORT] Migration of hudi tables encountered issue related to metadata column

nsivabalan commented on issue #8036:
URL: https://github.com/apache/hudi/issues/8036#issuecomment-1453987238

   good question.
   
   Depending on what sql tool you might use, you can try to explore how to select all columns except a few. then, you can ignore the hoodie meta columns explicitly in your insert into statement. 
   
   
   For eg, for spark sql, you can do the following 
   
   spark.sql("SET spark.sql.parser.quotedRegexColumnNames=true")
   
   #select all columns except a,b
   sql("select `(a|b)?+.+` from tmp").show()
   #+---+---+
   #| id|  c|
   #+---+---+
   #|  1|  4|
   #+---+---+
   
   Ref: https://stackoverflow.com/questions/63127263/how-to-select-all-columns-except-2-of-them-from-a-large-table-on-pyspark-sql
   
   Hive: https://stackoverflow.com/questions/51227890/hive-how-to-select-all-but-one-column
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org