You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Stamatis Zampetakis (Jira)" <ji...@apache.org> on 2022/10/21 07:21:01 UTC
[jira] [Updated] (HIVE-25698) Hive column update performance too low when table partition over 700
[ https://issues.apache.org/jira/browse/HIVE-25698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Stamatis Zampetakis updated HIVE-25698:
---------------------------------------
Fix Version/s: (was: All Versions)
I cleared the fixVersion field since this ticket is still open. Please review this ticket and if the fix is already committed to a specific version please set the version accordingly and mark the ticket as RESOLVED.
According to the [JIRA guidelines|https://cwiki.apache.org/confluence/display/Hive/HowToContribute] the fixVersion should be set only when the issue is resolved/closed.
> Hive column update performance too low when table partition over 700
> --------------------------------------------------------------------
>
> Key: HIVE-25698
> URL: https://issues.apache.org/jira/browse/HIVE-25698
> Project: Hive
> Issue Type: Bug
> Components: Clients, Server Infrastructure
> Affects Versions: 3.1.1
> Environment: CentOS 7.8
> Hadoop 3.1.1
> Impala 3.4.0
> Reporter: JungHyun An
> Priority: Minor
>
> Now we using hive 3.1.1
>
> Currently in our hive we have tables with hundreds of partitions and hundreds of gigabytes of data.
>
> When updating the column information of the corresponding table, it was confirmed that the performance was several tens of times slower than the Hive 1.1 version of the existing CDH.
>
> I would like to ask if there is any architectural change that makes column updates slower than Hive 1 in Hive 3 and later versions.
>
> Thank you.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)