You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2019/07/16 16:42:15 UTC

[jira] [Updated] (SPARK-27841) Improve UTF8String fromString()/toString()/numChars() performance when strings are ASCII

     [ https://issues.apache.org/jira/browse/SPARK-27841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dongjoon Hyun updated SPARK-27841:
----------------------------------
    Affects Version/s:     (was: 2.4.0)
                       3.0.0

> Improve UTF8String fromString()/toString()/numChars() performance when strings are ASCII
> ----------------------------------------------------------------------------------------
>
>                 Key: SPARK-27841
>                 URL: https://issues.apache.org/jira/browse/SPARK-27841
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 3.0.0
>            Reporter: Josh Rosen
>            Assignee: Josh Rosen
>            Priority: Major
>
> UTF8String's fromString(), toString(), and numChars() methods are performance hotspots. For strings which consist entirely of ASCII characters we can make performance optimizations which significantly reduce memory allocation and copying, greatly improving performance for many common workloads.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org