You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Thomas Tauber-Marshall (JIRA)" <ji...@apache.org> on 2017/04/24 22:58:04 UTC

[jira] [Created] (IMPALA-5255) Support representing multilevel partitioning schemes

Thomas Tauber-Marshall created IMPALA-5255:
----------------------------------------------

             Summary: Support representing multilevel partitioning schemes
                 Key: IMPALA-5255
                 URL: https://issues.apache.org/jira/browse/IMPALA-5255
             Project: IMPALA
          Issue Type: Improvement
          Components: Frontend
    Affects Versions: Impala 2.9.0
            Reporter: Thomas Tauber-Marshall


Currently, our DataPartition functionality allows only for a single partition specification (e.g. HASH and the corresponding columns).

Kudu allows tables to have multilevel partitioning schemes (eg. first HASH then RANGE). In the short term, we plan to represent Kudu partitioning schemes with a new TPartitionType of KUDU along with the id of the table to take the partitioning scheme from.

In the long run, we should allow for representing these multilevel partitioning schemes directly. This will allow us to make smarter choices about when to repartition data, e.g. during a grouping aggregation if the input is from Kudu and already partitioned correctly.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)