You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Sorabh Hamirwasia (JIRA)" <ji...@apache.org> on 2019/04/10 17:13:00 UTC

[jira] [Updated] (DRILL-7028) Reduce the planning time of queries on large Parquet tables with large metadata cache files

     [ https://issues.apache.org/jira/browse/DRILL-7028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sorabh Hamirwasia updated DRILL-7028:
-------------------------------------
    Fix Version/s: 1.17.0

> Reduce the planning time of queries on large Parquet tables with large metadata cache files
> -------------------------------------------------------------------------------------------
>
>                 Key: DRILL-7028
>                 URL: https://issues.apache.org/jira/browse/DRILL-7028
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Metadata
>            Reporter: Venkata Jyothsna Donapati
>            Assignee: Venkata Jyothsna Donapati
>            Priority: Major
>              Labels: performance
>             Fix For: 1.16.0, 1.17.0
>
>
> If the Parquet table has a large number of small files, the metadata cache files grow larger and the planner tries to read the large metadata cache file which leads to the planning time overhead. Most of the time of execution is spent during the planning phase.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)