You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2016/09/01 02:54:21 UTC

[jira] [Comment Edited] (HIVE-14451) Vectorization: Add byRef mode for borrowed Strings in VectorDeserializeRow

    [ https://issues.apache.org/jira/browse/HIVE-14451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15454101#comment-15454101 ] 

Matt McCline edited comment on HIVE-14451 at 9/1/16 2:53 AM:
-------------------------------------------------------------

This should improve performance for TEXT (LazySimple) and joins (BinarySortable/LazyBinary) [~gopalv] [~ndembla]

Thank you Gopal for the observing this improvement possibility.




was (Author: mmccline):
This should improve performance for TEXT (LazySimple) and non-TEXT (BinarySortable) [~gopalv] [~ndembla]

Thank you Gopal for the observing this improvement possibility.



> Vectorization: Add byRef mode for borrowed Strings in VectorDeserializeRow
> --------------------------------------------------------------------------
>
>                 Key: HIVE-14451
>                 URL: https://issues.apache.org/jira/browse/HIVE-14451
>             Project: Hive
>          Issue Type: Improvement
>          Components: Vectorization
>            Reporter: Gopal V
>            Assignee: Matt McCline
>         Attachments: HIVE-14451.01.patch, HIVE-14451.02.patch
>
>
> In a majority of cases, when using the OptimizedHashMap, the references to the byte[] are immutable. 
> The hashmap result always allocates on boundary conditions, but never mutates a previous buffer.
> Copying Strings out of the hashtable is entirely wasteful and it would be easy to know when the currentBytes is a borrowed slice from the original input.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)