You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Vaibhav Gumashta (JIRA)" <ji...@apache.org> on 2016/08/16 22:22:21 UTC
[jira] [Created] (HIVE-14551) HiveServer2: Use vectorized data
whenever available for writing final results
Vaibhav Gumashta created HIVE-14551:
---------------------------------------
Summary: HiveServer2: Use vectorized data whenever available for writing final results
Key: HIVE-14551
URL: https://issues.apache.org/jira/browse/HIVE-14551
Project: Hive
Issue Type: Sub-task
Components: HiveServer2
Affects Versions: 2.1.0
Reporter: Vaibhav Gumashta
In ThriftJDBCBinarySerde, which we are using in FileSinkOperator to write final results, we buffer rows and store them into typed columns before writing a batch of rows to the result file. However, when vectorized rows batches are available from higher level operators, we should try to use them and avoid the extra penalty of converting from vector --> non-vector single row --> buffered thrift columns (equivalent to vector).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)