You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Eric Wohlstadter (JIRA)" <ji...@apache.org> on 2018/05/02 20:44:00 UTC

[jira] [Comment Edited] (HIVE-19306) Arrow batch serializer

    [ https://issues.apache.org/jira/browse/HIVE-19306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16461599#comment-16461599 ] 

Eric Wohlstadter edited comment on HIVE-19306 at 5/2/18 8:43 PM:
-----------------------------------------------------------------

[~teddy.choi]

I don't think it is necessary or useful to use FieldWriter for non-complex types. 

Should we change it so that non-complex types use "set" and "setNull" directly on the vector?

I believe this would remove some of the indirection which is making the code harder to debug.

If we are very close to making it work with FieldWriter on non-complex types, let's go ahead with that.

I'm just concerned we get stuck trying to fix something that isn't necessary. 


was (Author: ewohlstadter):
[~teddy.choi]

I don't think it is necessary or useful to use FieldWriter for non-complex types. 

Should we change it so that non-complex types use "set" and "setNull" directly on the vector?

I believe this would remove some of the indirection which is making the code harder to debug.

> Arrow batch serializer
> ----------------------
>
>                 Key: HIVE-19306
>                 URL: https://issues.apache.org/jira/browse/HIVE-19306
>             Project: Hive
>          Issue Type: Task
>          Components: Serializers/Deserializers
>            Reporter: Eric Wohlstadter
>            Assignee: Teddy Choi
>            Priority: Major
>         Attachments: HIVE-19306.2.patch
>
>
> Leverage the ThriftJDBCBinarySerDe code path that already exists in SemanticAnalyzer/FileSinkOperator to create a serializer that batches rows into Arrow vector batches.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)