You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "JungHyun An (Jira)" <ji...@apache.org> on 2021/12/26 07:28:00 UTC
[jira] [Commented] (HIVE-25698) Hive column update performance too low when table partition over 700
[ https://issues.apache.org/jira/browse/HIVE-25698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17465304#comment-17465304 ]
JungHyun An commented on HIVE-25698:
------------------------------------
[~kgyrtkirk]
Currently, our HIVE has tables containing hundreds of TB of data. The "alter table 'table_name' change 'cloumn' " statement was executed on the corresponding table, and the execution time was much slower than the execution time of the existing HIVE version 1. I would like to know if there are any architectural changes in HIVE 1 and HIVE 3 related to that feature.
> Hive column update performance too low when table partition over 700
> --------------------------------------------------------------------
>
> Key: HIVE-25698
> URL: https://issues.apache.org/jira/browse/HIVE-25698
> Project: Hive
> Issue Type: Bug
> Components: Clients, Server Infrastructure
> Affects Versions: 3.1.1
> Environment: CentOS 7.8
> Hadoop 3.1.1
> Impala 3.4.0
> Reporter: JungHyun An
> Priority: Minor
> Fix For: All Versions
>
>
> Now we using hive 3.1.1
>
> Currently in our hive we have tables with hundreds of partitions and hundreds of gigabytes of data.
>
> When updating the column information of the corresponding table, it was confirmed that the performance was several tens of times slower than the Hive 1.1 version of the existing CDH.
>
> I would like to ask if there is any architectural change that makes column updates slower than Hive 1 in Hive 3 and later versions.
>
> Thank you.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)