You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Thomas Tauber-Marshall (JIRA)" <ji...@apache.org> on 2017/04/24 22:58:04 UTC
[jira] [Created] (IMPALA-5255) Support representing multilevel
partitioning schemes
Thomas Tauber-Marshall created IMPALA-5255:
----------------------------------------------
Summary: Support representing multilevel partitioning schemes
Key: IMPALA-5255
URL: https://issues.apache.org/jira/browse/IMPALA-5255
Project: IMPALA
Issue Type: Improvement
Components: Frontend
Affects Versions: Impala 2.9.0
Reporter: Thomas Tauber-Marshall
Currently, our DataPartition functionality allows only for a single partition specification (e.g. HASH and the corresponding columns).
Kudu allows tables to have multilevel partitioning schemes (eg. first HASH then RANGE). In the short term, we plan to represent Kudu partitioning schemes with a new TPartitionType of KUDU along with the id of the table to take the partitioning scheme from.
In the long run, we should allow for representing these multilevel partitioning schemes directly. This will allow us to make smarter choices about when to repartition data, e.g. during a grouping aggregation if the input is from Kudu and already partitioned correctly.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)