You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Stamatis Zampetakis (Jira)" <ji...@apache.org> on 2022/10/21 07:21:01 UTC

[jira] [Updated] (HIVE-25698) Hive column update performance too low when table partition over 700

     [ https://issues.apache.org/jira/browse/HIVE-25698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Stamatis Zampetakis updated HIVE-25698:
---------------------------------------
    Fix Version/s:     (was: All Versions)

I cleared the fixVersion field since this ticket is still open. Please review this ticket and if the fix is already committed to a specific version please set the version accordingly and mark the ticket as RESOLVED.

According to the [JIRA guidelines|https://cwiki.apache.org/confluence/display/Hive/HowToContribute] the fixVersion should be set only when the issue is resolved/closed.

> Hive column update performance too low when table partition over 700
> --------------------------------------------------------------------
>
>                 Key: HIVE-25698
>                 URL: https://issues.apache.org/jira/browse/HIVE-25698
>             Project: Hive
>          Issue Type: Bug
>          Components: Clients, Server Infrastructure
>    Affects Versions: 3.1.1
>         Environment: CentOS 7.8 
> Hadoop 3.1.1
> Impala 3.4.0
>            Reporter: JungHyun An
>            Priority: Minor
>
> Now we using hive 3.1.1
>  
> Currently in our hive we have tables with hundreds of partitions and hundreds of gigabytes of data.
>  
> When updating the column information of the corresponding table, it was confirmed that the performance was several tens of times slower than the Hive 1.1 version of the existing CDH.
>  
> I would like to ask if there is any architectural change that makes column updates slower than Hive 1 in Hive 3 and later versions.
>  
> Thank you.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)