You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Wei Zheng (Jira)" <ji...@apache.org> on 2022/12/05 18:08:00 UTC

[jira] [Updated] (HIVE-26685) Improve Path name escaping / unescaping performance

     [ https://issues.apache.org/jira/browse/HIVE-26685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wei Zheng updated HIVE-26685:
-----------------------------
    Fix Version/s: 4.0.0

> Improve Path name escaping / unescaping performance
> ---------------------------------------------------
>
>                 Key: HIVE-26685
>                 URL: https://issues.apache.org/jira/browse/HIVE-26685
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>    Affects Versions: All Versions
>            Reporter: James Petty
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 4.0.0
>
>         Attachments: HIVE-26685.1.patch
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> When escaping / unescaping partition path part names, the existing logic incurs significant avoidable overhead by copying each character sequentially into a new StringBuilder even when no escaping/unescaping is necessary as well as using String.format to escape characters inside of the inner loop.
>  
> The included patch to improve the performance of these operations refactors two static method implementations, but requires no external API surface or user-visible behavior changes. This change is applicable and portable to a wide range of Hive versions from branch-0.6 onward when the initial method implementations were added.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)