You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2023/01/17 02:39:00 UTC

[jira] [Updated] (IMPALA-11843) IndexOutOfBoundsException in analytic limit pushdown

     [ https://issues.apache.org/jira/browse/IMPALA-11843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Quanlong Huang updated IMPALA-11843:
------------------------------------
    Target Version: Impala 4.1.2

> IndexOutOfBoundsException in analytic limit pushdown
> ----------------------------------------------------
>
>                 Key: IMPALA-11843
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11843
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>    Affects Versions: Impala 4.0.0, Impala 4.1.0, Impala 4.2.0, Impala 4.1.1
>            Reporter: Quanlong Huang
>            Assignee: Quanlong Huang
>            Priority: Critical
>
> The following query fails with IndexOutOfBoundsException:
> {code:sql}
> create table tbl (id int);
> select id from (
>   select id, 
>     row_number() over (order by id) rn, 
>     max(id) over () max_id
>   from tbl
> ) t
> where id = max_id and rn < 10;
> ERROR: IndexOutOfBoundsException: Index: 0, Size: 0
> {code}
> The stacktrace in logs:
> {noformat}
> I0116 15:55:46.766265 23944 Frontend.java:2062] be402cb92ecc5490:11cbe79000000000] Analysis and authorization finished.
> I0116 15:55:46.803608 23944 jni-util.cc:288] be402cb92ecc5490:11cbe79000000000] java.lang.IndexOutOfBoundsException: Index: 0, Size: 0
>         at java.util.ArrayList.rangeCheck(ArrayList.java:659)
>         at java.util.ArrayList.get(ArrayList.java:435)
>         at org.apache.impala.planner.AnalyticPlanner.inferPartitionLimits(AnalyticPlanner.java:914)
>         at org.apache.impala.planner.AnalyticPlanner.createSingleNodePlan(AnalyticPlanner.java:115)
>         at org.apache.impala.planner.SingleNodePlanner.createQueryPlan(SingleNodePlanner.java:295)
>         at org.apache.impala.planner.SingleNodePlanner.createInlineViewPlan(SingleNodePlanner.java:1244)
>         at org.apache.impala.planner.SingleNodePlanner.createTableRefNode(SingleNodePlanner.java:2208)
>         at org.apache.impala.planner.SingleNodePlanner.createTableRefsPlan(SingleNodePlanner.java:931)
>         at org.apache.impala.planner.SingleNodePlanner.createSelectPlan(SingleNodePlanner.java:750)
>         at org.apache.impala.planner.SingleNodePlanner.createQueryPlan(SingleNodePlanner.java:278)
>         at org.apache.impala.planner.SingleNodePlanner.createSingleNodePlan(SingleNodePlanner.java:170)
>         at org.apache.impala.planner.Planner.createPlanFragments(Planner.java:120)
>         at org.apache.impala.planner.Planner.createPlans(Planner.java:249)
>         at org.apache.impala.service.Frontend.createExecRequest(Frontend.java:1733)
>         at org.apache.impala.service.Frontend.getPlannedExecRequest(Frontend.java:2344)
>         at org.apache.impala.service.Frontend.doCreateExecRequest(Frontend.java:2181)
>         at org.apache.impala.service.Frontend.getTExecRequest(Frontend.java:1967)
>         at org.apache.impala.service.Frontend.createExecRequest(Frontend.java:1789)
>         at org.apache.impala.service.JniFrontend.createExecRequest(JniFrontend.java:164)
> {noformat}
> There is a predicate "rn < 10" on the row_number() results so the limit is considered push down into the inline view to make it a TopN query. While considering the conjuncts, the other predicate "id = max_id" is also checked. It fails the following code in AnalyticPlanner.inferPartitionLimits():
> {code:java}
>       List<Expr> lhsSourceExprs = ((SlotRef) lhs).getDesc().getSourceExprs();
>       if (lhsSourceExprs.size() > 1 ||
>             !(lhsSourceExprs.get(0) instanceof AnalyticExpr)) {
>         continue;
>       }
> {code}
> [https://github.com/apache/impala/blob/f2f6b4b5804df036a5a7dc8ff23f8a0537b5bf97/fe/src/main/java/org/apache/impala/planner/AnalyticPlanner.java#L912-L916]
> 'lhsSourceExprs' is empty since "id" is a slot ref for a source table column. Thus lhsSourceExprs.get(0) throws the IndexOutOfBoundsException.
> To workaround this bug, users can disable this optimization by query option:
> {code:java}
> set ANALYTIC_RANK_PUSHDOWN_THRESHOLD=0;{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org