You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Xianqing He (Jira)" <ji...@apache.org> on 2021/07/15 11:53:00 UTC

[jira] [Updated] (IMPALA-10799) Analysis slowdown with inline views and thousands of column

     [ https://issues.apache.org/jira/browse/IMPALA-10799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xianqing He updated IMPALA-10799:
---------------------------------
    Description: 
If there are thousands of columns in the inlineview, it‘s very slow in analysis. For example, this sql will take almost 4s in analysis if the inline view has tens of thousands of column
{code:java}
select c1 from (select c1, c2... c10001 from T) T


   Query Compilation: 3s880ms
       - Translate start: 968.000ns (968.000ns)
       - Translate finished: 4.318ms (4.317ms)
       - Metadata of all 1 tables cached: 42.219ms (37.900ms)
       - Analysis finished: 3s776ms (3s734ms)
       - Value transfer graph computed: 3s806ms (30.163ms)
       - Single node plan created: 3s869ms (62.556ms)
       - Runtime filters computed: 3s874ms (5.603ms)
       - Distributed plan created: 3s874ms (128.086us)
       - Planning finished: 3s880ms (5.836ms)
{code}

  was:
If there are thousands of columns in the inlineview, it‘s very slow in analysis. For example, this sql will take almost 4s in analysis.
{code:java}
select c1 from (select c1, c2... c10000 from T) T


   Query Compilation: 3s880ms
       - Translate start: 968.000ns (968.000ns)
       - Translate finished: 4.318ms (4.317ms)
       - Metadata of all 1 tables cached: 42.219ms (37.900ms)
       - Analysis finished: 3s776ms (3s734ms)
       - Value transfer graph computed: 3s806ms (30.163ms)
       - Single node plan created: 3s869ms (62.556ms)
       - Runtime filters computed: 3s874ms (5.603ms)
       - Distributed plan created: 3s874ms (128.086us)
       - Planning finished: 3s880ms (5.836ms)
{code}


> Analysis slowdown with inline views and thousands of column
> -----------------------------------------------------------
>
>                 Key: IMPALA-10799
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10799
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Frontend
>            Reporter: Xianqing He
>            Assignee: Xianqing He
>            Priority: Major
>
> If there are thousands of columns in the inlineview, it‘s very slow in analysis. For example, this sql will take almost 4s in analysis if the inline view has tens of thousands of column
> {code:java}
> select c1 from (select c1, c2... c10001 from T) T
>    Query Compilation: 3s880ms
>        - Translate start: 968.000ns (968.000ns)
>        - Translate finished: 4.318ms (4.317ms)
>        - Metadata of all 1 tables cached: 42.219ms (37.900ms)
>        - Analysis finished: 3s776ms (3s734ms)
>        - Value transfer graph computed: 3s806ms (30.163ms)
>        - Single node plan created: 3s869ms (62.556ms)
>        - Runtime filters computed: 3s874ms (5.603ms)
>        - Distributed plan created: 3s874ms (128.086us)
>        - Planning finished: 3s880ms (5.836ms)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org