You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/10/23 13:05:38 UTC
[GitHub] [hudi] MihawkZoro opened a new issue, #7040: [SUPPORT] spark-sql schema_evolution
MihawkZoro opened a new issue, #7040:
URL: https://github.com/apache/hudi/issues/7040
**Environment Description**
* Hudi version :
0.11.1
* Spark version :
3.2.2
* Hive version :
2.3.9
* Hadoop version :
2.7.3
* Storage
hdfs
**Describe the problem you faced**
I have a hudi table
```
create table ddl_test_t2 (
col1 string,
col2 string,
col3 string,
ts bigint
) using hudi
tblproperties (
type = 'mor',
primaryKey = 'col1',
preCombineField = 'ts'
);
```
I executed some DML and DDL for test about schema evolution
```
insert into ddl_test_t2 values('1','col2','col3',1),('2','col2','col3',2),('3','col2','col3',3);
ALTER TABLE ddl_test_t2 DROP COLUMN col3;
ALTER TABLE ddl_test_t2 RENAME COLUMN col2 to col3;
insert into ddl_test_t2 values('4','col2',4);
```
then I searched column col3 from table ddl_test_t2
```
select col3 from ddl_test_t2;
```
the result I expect is
```
col2
col2
col2
col2
```
the actual result was
```
col3
col3
col3
col2
```
<img width="1218" alt="image" src="https://user-images.githubusercontent.com/32875366/197393752-8b690175-2c3c-4f0a-bef5-d6e604eca03a.png">
I want know what is the problem and if this is a bug
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] xiarixiaoyao commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
xiarixiaoyao commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1288365585
@MihawkZoro schema evolution for hive and presto(mor table) can be found https://github.com/apache/hudi/pull/6989
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] xiarixiaoyao commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
xiarixiaoyao commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1290252404
> @xiarixiaoyao Thank you. When will the repaired official spark bundle jar be released?
expect 0.13.0
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] xiarixiaoyao commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
xiarixiaoyao commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1288348142
rewriteRecordWithNewSchema deal with rename failed,it should deal with rename first
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] MihawkZoro commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
MihawkZoro commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1288357272
@xiarixiaoyao When will this bug be fixed, we are using this feature, it is urgent
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] MihawkZoro commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
MihawkZoro commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1288368315
@xiarixiaoyao Thank you. When will the repaired official spark bundle jar be released?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] xiarixiaoyao commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
xiarixiaoyao commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1288359886
already fix local, let me raise a pr
spark.sql("set hoodie.schema.on.read.enable=true")
spark.sql("""create table ddl_test_t2 (
| col1 string,
| col2 string,
| col3 string,
| ts bigint
|) using hudi
|tblproperties (
| type = 'mor',
| primaryKey = 'col1',
| preCombineField = 'ts'
|)""".stripMargin)
spark.sql("insert into ddl_test_t2 values('1','col2','col3',1),('2','col2','col3',2),('3','col2','col3',3)")
spark.sql("""ALTER TABLE ddl_test_t2 DROP COLUMN col3""")
spark.sql("ALTER TABLE ddl_test_t2 RENAME COLUMN col2 to col3")
spark.sql("insert into ddl_test_t2 values('4','col2',4)")
spark.sql("select col3 from ddl_test_t2").show(false)
+----+
|col3|
+----+
|col2|
|col2|
|col2|
|col2|
+----+
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] xiarixiaoyao commented on issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
xiarixiaoyao commented on issue #7040:
URL: https://github.com/apache/hudi/issues/7040#issuecomment-1288334741
@MihawkZoro
Thank you for your test,
This is really a bug, the final write ‘insert into ddl_test_t2 values('4','col2',4);’ trigger is bug,Fix this bug as soon as possible
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [hudi] xiarixiaoyao closed issue #7040: [SUPPORT] spark-sql schema_evolution
Posted by GitBox <gi...@apache.org>.
xiarixiaoyao closed issue #7040: [SUPPORT] spark-sql schema_evolution
URL: https://github.com/apache/hudi/issues/7040
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org