You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@gobblin.apache.org by "Zihan Li (Jira)" <ji...@apache.org> on 2021/08/13 01:24:00 UTC

[jira] [Created] (GOBBLIN-1515) Fix hive schema not updated issue in streaming and GMIP

Zihan Li created GOBBLIN-1515:
---------------------------------

             Summary: Fix hive schema not updated issue in streaming and GMIP
                 Key: GOBBLIN-1515
                 URL: https://issues.apache.org/jira/browse/GOBBLIN-1515
             Project: Apache Gobblin
          Issue Type: Bug
            Reporter: Zihan Li


There are two issues need to be fixed:

1. Fix the issue that Hive schema is using avro schema even for orc table: Now, since we directly use avro schema literal to calculate the column and column types for orc table, it sometimes is not correct, so we need to change avro schema to orc schema and compute column types accordingly

2. Change the behavior that we only evolve avro schema literal but not the hive schema when schema evolution happen: In Streaming, when we do hive registration, we compare the current schema with schema registry and when there is schema evolution, we only update avro schema literal and leave hive schema to be lag. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)