You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "BELUGA BEHR (JIRA)" <ji...@apache.org> on 2018/01/22 19:20:00 UTC

[jira] [Created] (HIVE-18512) Get Results ReadAhead

BELUGA BEHR created HIVE-18512:
----------------------------------

             Summary: Get Results ReadAhead
                 Key: HIVE-18512
                 URL: https://issues.apache.org/jira/browse/HIVE-18512
             Project: Hive
          Issue Type: Improvement
          Components: HiveServer2
    Affects Versions: 3.0.0, 2.4.0
            Reporter: BELUGA BEHR


I don't have any data to back this up, but I wanted to put it on the radar.

It may be possible to improve performance of HS2 with an HDFS read-ahead reader for result data.  This would require adding a cache (configurable size) to the Driver/Context object and adding a separate thread for loading results asynchronously while the client is processing its current batch of results.  It seems that currently, results are loaded on demand.

https://github.com/apache/hive/blob/6d890faf22fd1ede3658a5eed097476eab3c67e9/ql/src/java/org/apache/hadoop/hive/ql/Driver.java#L2298



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)