You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@calcite.apache.org by "Vineet Garg (JIRA)" <ji...@apache.org> on 2016/11/08 05:36:59 UTC

[jira] [Commented] (CALCITE-1483) Suboptimal plan for NOT IN query

    [ https://issues.apache.org/jira/browse/CALCITE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15646579#comment-15646579 ] 

Vineet Garg commented on CALCITE-1483:
--------------------------------------

Quoting Julian "Note that there are two scans of DEPT, but one is sufficient because DEPTNO is never null. In the JdbcAggregate, c always equals ck, and therefore the CASE can be simplified, and therefore the scan of DEPT that produces c and ck can be dropped, but Calcite rules cannot deduce that fact."

I'll try to find more examples of such queries

> Suboptimal plan for NOT IN query
> --------------------------------
>
>                 Key: CALCITE-1483
>                 URL: https://issues.apache.org/jira/browse/CALCITE-1483
>             Project: Calcite
>          Issue Type: Bug
>          Components: core
>            Reporter: Vineet Garg
>            Assignee: Julian Hyde
>
> Following query generates sub-optimal plan
> {code} explain plan for select * from scott.emp where deptno not in (select deptno from scott.dept where deptno = 20); {code}
> Following is the plan
> {code}
> EnumerableCalc(expr#0..11=[{inputs}], expr#12=[0], expr#13=[=($t8, $t12)], expr#14=[false], expr#15=[IS NOT NULL($t11)], expr#16=[true], expr#17=[IS NULL($t7)], expr#18=[null], expr#19=[<($t9, $t8)], expr#20=[CASE($t13, $t14, $t15, $t16, $t17, $t18, $t19, $t16, $t14)], expr#21=[NOT($t20)], proj#0..7=[{exprs}], $condition=[$t21])
>   EnumerableJoin(condition=[=($7, $10)], joinType=[left])
>     EnumerableCalc(expr#0..9=[{inputs}], EMPNO=[$t2], ENAME=[$t3], JOB=[$t4], MGR=[$t5], HIREDATE=[$t6], SAL=[$t7], COMM=[$t8], DEPTNO=[$t9], c=[$t0], ck=[$t1])
>       EnumerableJoin(condition=[true], joinType=[inner])
>         JdbcToEnumerableConverter
>           JdbcAggregate(group=[{}], c=[COUNT()], ck=[COUNT($0)])
>             JdbcFilter(condition=[=(CAST($0):INTEGER NOT NULL, 20)])
>               JdbcTableScan(table=[[SCOTT, DEPT]])
>         JdbcToEnumerableConverter
>           JdbcTableScan(table=[[SCOTT, EMP]])
>     JdbcToEnumerableConverter
>       JdbcAggregate(group=[{0, 1}])
>         JdbcProject(DEPTNO=[$0], i=[true])
>           JdbcFilter(condition=[=(CAST($0):INTEGER NOT NULL, 20)])
>             JdbcTableScan(table=[[SCOTT, DEPT]])
> {code}
> As Julian pointed out in discussion on mailing list instead of two scans for DEPT one is sufficient as clearly DEPTNO is never null.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)