You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@calcite.apache.org by "yanjing.wang (Jira)" <ji...@apache.org> on 2022/01/26 10:19:00 UTC
[jira] [Updated] (CALCITE-4683) In-list to join causes field datatypes not matched
[ https://issues.apache.org/jira/browse/CALCITE-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
yanjing.wang updated CALCITE-4683:
----------------------------------
Summary: In-list to join causes field datatypes not matched (was: Conversion to relational algebra failed to preserve datatypes when executing a sql query having in clause)
> In-list to join causes field datatypes not matched
> --------------------------------------------------
>
> Key: CALCITE-4683
> URL: https://issues.apache.org/jira/browse/CALCITE-4683
> Project: Calcite
> Issue Type: Bug
> Components: core
> Affects Versions: 1.26.0, 1.27.0
> Environment: jdk8
>
> Reporter: yanjing.wang
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> The sql query is
> {code:java}
> SELECT * FROM (SELECT '20210101' AS dt, deptno, max(cast(deptno2 as
> varchar(200))) as m FROM (SELECT emp.deptno as deptno, dept.deptno as deptno2 FROM emp
> JOIN dept on emp.deptno = dept.deptno) tmp GROUP BY deptno) WHERE cast(deptno as
> varchar) in ('1', '3', '5')
> {code}
> When Calcite converts the in list to a join, the original relational algebra root will be replaced by a new project, for example
> When we set InSubQueryThreshold value to 2, then the relational algebra tree will be converted from
> {code:java}
> LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1])
> LogicalAggregate(group=[{0}], M=[MAX($1)])
> LogicalProject(DEPTNO=[$7], $f1=[CAST($9):VARCHAR(200) NOT NULL])
> LogicalJoin(condition=[=($7, $9)], joinType=[inner])
> LogicalTableScan(table=[[CATALOG, SALES, EMP]])
> LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
> {code}
> to
> {code:java}
> LogicalJoin(condition=[=($3, $4)], joinType=[inner])
> LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1], DEPTNO0=[CAST($0):VARCHAR NOT NULL])
> LogicalAggregate(group=[{0}], M=[MAX($1)])
> LogicalProject(DEPTNO=[$7], $f1=[CAST($9):VARCHAR(200) NOT NULL])
> LogicalJoin(condition=[=($7, $9)], joinType=[inner])
> LogicalTableScan(table=[[CATALOG, SALES, EMP]])
> LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
> LogicalAggregate(group=[{0}])
> LogicalValues(tuples=[[{ '1' }, { '3' }, { '5' }]])
> {code}
> We can see that LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1]) with leaves being true has been replaced by LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1], DEPTNO0=[CAST($0):VARCHAR NOT NULL]) with leaves being false.
> Finally, the query results java.lang.AssertionError: Conversion to relational algebra failed to preserve datatypes.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)