You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Grant Henke (Jira)" <ji...@apache.org> on 2019/10/17 14:15:00 UTC
[jira] [Created] (HIVE-22362) Support key-range splitting by size
the HiveKuduInputFormat
Grant Henke created HIVE-22362:
----------------------------------
Summary: Support key-range splitting by size the HiveKuduInputFormat
Key: HIVE-22362
URL: https://issues.apache.org/jira/browse/HIVE-22362
Project: Hive
Issue Type: Sub-task
Reporter: Grant Henke
In order to allow for more parallelism and predictable task sizes we should support Kudu key range splitting to allow more parallel tasks per tablet. Without this the parallelism is limited by the number of tablets to scan.
The implementation is like similar to the Spark implementation here:
https://github.com/apache/kudu/commit/22a6faa44364dec3a171ec79c15b814ad9277d8f
--
This message was sent by Atlassian Jira
(v8.3.4#803005)