You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "David Mollitor (Jira)" <ji...@apache.org> on 2020/06/11 17:03:00 UTC
[jira] [Assigned] (HIVE-21636) Performance cost when using
replaceAll() vs replace()
[ https://issues.apache.org/jira/browse/HIVE-21636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
David Mollitor reassigned HIVE-21636:
-------------------------------------
Assignee: bd2019us
> Performance cost when using replaceAll() vs replace()
> ------------------------------------------------------
>
> Key: HIVE-21636
> URL: https://issues.apache.org/jira/browse/HIVE-21636
> Project: Hive
> Issue Type: Improvement
> Components: Accumulo Storage Handler, HCatalog, Vectorization
> Reporter: bd2019us
> Assignee: bd2019us
> Priority: Trivial
> Labels: pull-request-available
> Attachments: HVIE-21636.patch
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Use String.replace() instead of String.replaceAll() when replaceAll() does not use a regex
> replace() does not need extra compilation / performance overhead when a non-regex string is used. Thus changing replaceAll() to replace() can remove the associated performance overhead.
> Affected files:
> # accumulo-handler/src/java/org/apache/hadoop/hive/accumulo/predicate/compare/StringCompare.java
> # hcatalog/core/src/main/java/org/apache/hive/hcatalog/mapreduce/FileOutputCommitterContainer.java
> # vector-code-gen/src/org/apache/hadoop/hive/tools/GenVectorCode.java
--
This message was sent by Atlassian Jira
(v8.3.4#803005)