You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Joe McDonnell (Jira)" <ji...@apache.org> on 2020/12/23 20:47:00 UTC

[jira] [Resolved] (IMPALA-8509) Data load schema generation should lazily evaluate shell substitutions

     [ https://issues.apache.org/jira/browse/IMPALA-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joe McDonnell resolved IMPALA-8509.
-----------------------------------
    Fix Version/s: Impala 3.4.0
       Resolution: Fixed

Most of the eval blocks were converted to use LOAD DATA statements here:

[https://github.com/apache/impala/commit/90ab610d3432a93248486ee38e10f58ec47908e6]

That improved the generate-schema-statements.py time significantly, so I'm going to close this as resolved.

> Data load schema generation should lazily evaluate shell substitutions
> ----------------------------------------------------------------------
>
>                 Key: IMPALA-8509
>                 URL: https://issues.apache.org/jira/browse/IMPALA-8509
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Infrastructure
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Major
>             Fix For: Impala 3.4.0
>
>
> Some of the data loading commands (testescape_* tables in particular) use shell statements to execute. These are evaluated by the 'eval_section' function in generate-schema-statements.py. However, the 'eval_section' call was eagerly evaluated even if the table already existed, which makes 'generate-schema-statements' quite slow on every run.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)