You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Grant Henke (Jira)" <ji...@apache.org> on 2019/10/17 14:15:00 UTC

[jira] [Created] (HIVE-22362) Support key-range splitting by size the HiveKuduInputFormat

Grant Henke created HIVE-22362:
----------------------------------

             Summary: Support key-range splitting by size the HiveKuduInputFormat
                 Key: HIVE-22362
                 URL: https://issues.apache.org/jira/browse/HIVE-22362
             Project: Hive
          Issue Type: Sub-task
            Reporter: Grant Henke


In order to allow for more parallelism and predictable task sizes we should support Kudu key range splitting to allow more parallel tasks per tablet. Without this the parallelism is limited by the number of tablets to scan. 

The implementation is like similar to the Spark implementation here:
https://github.com/apache/kudu/commit/22a6faa44364dec3a171ec79c15b814ad9277d8f



--
This message was sent by Atlassian Jira
(v8.3.4#803005)