You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Fu Kai (Jira)" <ji...@apache.org> on 2021/04/06 07:37:00 UTC

[jira] [Created] (FLINK-22113) UniqueKey constraint is lost with multiple sources join in SQL

Fu Kai created FLINK-22113:
------------------------------

             Summary: UniqueKey constraint is lost with multiple sources join in SQL
                 Key: FLINK-22113
                 URL: https://issues.apache.org/jira/browse/FLINK-22113
             Project: Flink
          Issue Type: Bug
          Components: Table SQL / Planner
    Affects Versions: 1.13.0
            Reporter: Fu Kai


Hi team,
 
We have a use case to join multiple data sources to generate a continuous updated view. We defined primary key constraint on all the input sources and all the keys are the subsets in the join condition. All joins are left join.
 
In our case, the first two inputs can produce *JoinKeyContainsUniqueKey* input sepc, which is good and performant. While when it comes to the third input source, it's joined with the intermediate output table of the first two input tables, and the intermediate table does not carry key constraint information(although the thrid source input table does), so it results in a *NoUniqueKey* input sepc. Given NoUniqueKey inputs has dramatic performance implications per the[ Force Join Unique Key|http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Force-Join-Unique-Key-td39521.html#a39651] email thread, we want to know if there is any mitigation solution for this.
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)