You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Wei Zheng (Jira)" <ji...@apache.org> on 2022/12/05 18:08:00 UTC
[jira] [Updated] (HIVE-26685) Improve Path name escaping / unescaping performance
[ https://issues.apache.org/jira/browse/HIVE-26685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wei Zheng updated HIVE-26685:
-----------------------------
Fix Version/s: 4.0.0
> Improve Path name escaping / unescaping performance
> ---------------------------------------------------
>
> Key: HIVE-26685
> URL: https://issues.apache.org/jira/browse/HIVE-26685
> Project: Hive
> Issue Type: Improvement
> Components: Hive
> Affects Versions: All Versions
> Reporter: James Petty
> Priority: Minor
> Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-26685.1.patch
>
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> When escaping / unescaping partition path part names, the existing logic incurs significant avoidable overhead by copying each character sequentially into a new StringBuilder even when no escaping/unescaping is necessary as well as using String.format to escape characters inside of the inner loop.
>
> The included patch to improve the performance of these operations refactors two static method implementations, but requires no external API surface or user-visible behavior changes. This change is applicable and portable to a wide range of Hive versions from branch-0.6 onward when the initial method implementations were added.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)