You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hive.apache.org by "Vaibhav Gumashta (JIRA)" <ji...@apache.org> on 2016/08/16 22:22:21 UTC

[jira] [Created] (HIVE-14551) HiveServer2: Use vectorized data whenever available for writing final results

Vaibhav Gumashta created HIVE-14551:
---------------------------------------

             Summary: HiveServer2: Use vectorized data whenever available for writing final results 
                 Key: HIVE-14551
                 URL: https://issues.apache.org/jira/browse/HIVE-14551
             Project: Hive
          Issue Type: Sub-task
          Components: HiveServer2
    Affects Versions: 2.1.0
            Reporter: Vaibhav Gumashta


In ThriftJDBCBinarySerde, which we are using in FileSinkOperator to write final results, we buffer rows and store them into typed columns before writing a batch of rows to the result file. However, when vectorized rows batches are available from higher level operators, we should try to use them and avoid the extra penalty of converting from vector --> non-vector single row --> buffered thrift columns (equivalent to vector).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)