You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@flink.apache.org by 刘 家锹 <LJ...@outlook.com> on 2022/03/04 02:53:48 UTC

CDC 分表同步快照顺序

Hi, all

我们在使用Flink CDC同步多张表,然后合并slink到一张es表中。但表之间有数据流转关系,比如有table_1, table_2, table2,  一条数据A之前table_1,但后续可能更新到table_2。
想请教下,如果使用正则表达式匹配同步分表,是否可以保证数据有序无误呢? 也就是全部分表同时快照,且等待所有分表快照同步完后才开始处理binlog。
从文档[1]中看到对于单表这种模式是可以保证的,但不确定多表且有数据流转是否也一样。



[1] https://ververica.github.io/flink-cdc-connectors/master/content/connectors/mysql-cdc.html
MySQL CDC Connector — Flink CDC documentation<https://ververica.github.io/flink-cdc-connectors/master/content/connectors/mysql-cdc.html>
SQL Client JAR¶. Download link is available only for stable releases.. Download flink-sql-connector-mysql-cdc-2.2-SNAPSHOT.jar and put it under <FLINK_HOME>/lib/.. Setup MySQL server¶. You have to define a MySQL user with appropriate permissions on all databases that the Debezium MySQL connector monitors.
ververica.github.io