You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "bharath v (JIRA)" <ji...@apache.org> on 2018/11/15 19:37:00 UTC

[jira] [Commented] (IMPALA-7854) Slow ALTER TABLE and LOAD DATA statements for tables with large number of partitions

    [ https://issues.apache.org/jira/browse/IMPALA-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16688556#comment-16688556 ] 

bharath v commented on IMPALA-7854:
-----------------------------------

Looks similar to IMPALA-7330. Could you try it out on the latest version if possible?

> Slow ALTER TABLE and LOAD DATA statements for tables with large number of partitions
> ------------------------------------------------------------------------------------
>
>                 Key: IMPALA-7854
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7854
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Catalog
>    Affects Versions: Impala 2.12.0
>         Environment: 14 Nodes
> Table in question has 20 columns, 3 partition columns, and 57,475 partitions
>            Reporter: vietn
>            Priority: Critical
>              Labels: impala, performance
>
> ALTER TABLE and LOAD DATA statements take minutes (9 minutes for ALTER TABLE and 6 minutes for LOAD DATA) for tables with a large number of partitions.
> Our workaround was to use Hive to perform the LOAD DATA and then perform a REFRESH PARTITION using Impala.
>  * 14 Nodes
>  * Table in question has 20 columns, 3 partition columns, and 57,475 partitions



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org