You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "James Petty (Jira)" <ji...@apache.org> on 2022/11/02 17:22:00 UTC

[jira] [Commented] (HIVE-26685) Improve Path name escaping / unescaping performance

    [ https://issues.apache.org/jira/browse/HIVE-26685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17627846#comment-17627846 ] 

James Petty commented on HIVE-26685:
------------------------------------

Since this is my first contribution to the Hive project, I'm not sure what the preferred process should be for targeting this change at the different applicable branches. Please let me know whether targeting the pull request at the master branch is sufficient or whether I need to take some extra steps to have this patch included in the other release branches.

> Improve Path name escaping / unescaping performance
> ---------------------------------------------------
>
>                 Key: HIVE-26685
>                 URL: https://issues.apache.org/jira/browse/HIVE-26685
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>    Affects Versions: All Versions
>            Reporter: James Petty
>            Priority: Minor
>              Labels: pull-request-available
>         Attachments: HIVE-26685.1.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When escaping / unescaping partition path part names, the existing logic incurs significant avoidable overhead by copying each character sequentially into a new StringBuilder even when no escaping/unescaping is necessary as well as using String.format to escape characters inside of the inner loop.
>  
> The included patch to improve the performance of these operations refactors two static method implementations, but requires no external API surface or user-visible behavior changes. This change is applicable and portable to a wide range of Hive versions from branch-0.6 onward when the initial method implementations were added.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)