You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@calcite.apache.org by "yanjing.wang (Jira)" <ji...@apache.org> on 2022/01/26 10:19:00 UTC

[jira] [Updated] (CALCITE-4683) In-list to join causes field datatypes not matched

     [ https://issues.apache.org/jira/browse/CALCITE-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

yanjing.wang updated CALCITE-4683:
----------------------------------
    Summary: In-list to join causes field datatypes not matched  (was: Conversion to relational algebra failed to preserve datatypes when executing a sql query having in clause)

> In-list to join causes field datatypes not matched
> --------------------------------------------------
>
>                 Key: CALCITE-4683
>                 URL: https://issues.apache.org/jira/browse/CALCITE-4683
>             Project: Calcite
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.26.0, 1.27.0
>         Environment: jdk8
>  
>            Reporter: yanjing.wang
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The sql query is 
> {code:java}
> SELECT * FROM (SELECT '20210101' AS dt, deptno, max(cast(deptno2 as
> varchar(200))) as m FROM (SELECT emp.deptno as deptno, dept.deptno as deptno2 FROM emp
> JOIN dept on emp.deptno = dept.deptno) tmp GROUP BY deptno) WHERE cast(deptno as
> varchar) in ('1', '3', '5')
> {code}
> When Calcite converts the in list to a join, the original relational algebra root will be replaced by a new project, for example
> When we set InSubQueryThreshold value to 2, then the relational algebra tree will be converted from 
> {code:java}
> LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1])
>   LogicalAggregate(group=[{0}], M=[MAX($1)])
>     LogicalProject(DEPTNO=[$7], $f1=[CAST($9):VARCHAR(200) NOT NULL])
>       LogicalJoin(condition=[=($7, $9)], joinType=[inner])
>         LogicalTableScan(table=[[CATALOG, SALES, EMP]])
>         LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
> {code}
> to
> {code:java}
> LogicalJoin(condition=[=($3, $4)], joinType=[inner])
>   LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1], DEPTNO0=[CAST($0):VARCHAR NOT NULL])
>     LogicalAggregate(group=[{0}], M=[MAX($1)])
>       LogicalProject(DEPTNO=[$7], $f1=[CAST($9):VARCHAR(200) NOT NULL])
>         LogicalJoin(condition=[=($7, $9)], joinType=[inner])
>           LogicalTableScan(table=[[CATALOG, SALES, EMP]])
>           LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
>   LogicalAggregate(group=[{0}])
>     LogicalValues(tuples=[[{ '1' }, { '3' }, { '5' }]])
> {code}
> We can see that LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1]) with leaves being true has been replaced by LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1], DEPTNO0=[CAST($0):VARCHAR NOT NULL]) with leaves being false. 
> Finally, the query results java.lang.AssertionError: Conversion to relational algebra failed to preserve datatypes. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)