You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Thomas Tauber-Marshall (JIRA)" <ji...@apache.org> on 2017/12/18 17:47:00 UTC

[jira] [Resolved] (IMPALA-6297) Remove partition/sort from Kudu INSERT for unpartitioned tables

     [ https://issues.apache.org/jira/browse/IMPALA-6297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thomas Tauber-Marshall resolved IMPALA-6297.
--------------------------------------------
       Resolution: Fixed
    Fix Version/s: Impala 2.12.0

commit 3f1f7063935f8a1217ced0bb15c0f33f7f4931b6
Author: Thomas Tauber-Marshall <tm...@cloudera.com>
Date:   Fri Dec 8 14:34:56 2017 -0800

    IMPALA-6297: Don't partition/sort for DML on unpartitioned Kudu table
    
    Impala partitions and sorts rows according to the target table's
    partitioning scheme before inserting them into Kudu in order to
    improve the performance of large inserts.
    
    A recent change added the ability to create unpartitioned Kudu tables,
    but Impala still does the partitioning/sorting for them even though
    its wasted work.
    
    This patch modifies the planner to not add the partition/sort for Kudu
    inserts if the table is unpartitioned, unless the clustered/shuffle
    hints are used.
    
    It also removes the exchange in the case where the partition exprs are
    all constant.
    
    Testing:
    - Added planner tests for inserting into an unpartitioned Kudu table,
      with and without hints, and for when the partition exprs are
      constant.
    - Ran the existing correctness tests for inserts into unpartitioned
      Kudu tables in kudu_create.test
    
    Change-Id: I3e01a7dd5284767a25df3218656746a5d0ee4632
    Reviewed-on: http://gerrit.cloudera.org:8080/8810
    Reviewed-by: Alex Behm <al...@cloudera.com>
    Tested-by: Impala Public Jenkins

> Remove partition/sort from Kudu INSERT for unpartitioned tables
> ---------------------------------------------------------------
>
>                 Key: IMPALA-6297
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6297
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>    Affects Versions: Impala 2.11.0
>            Reporter: Thomas Tauber-Marshall
>            Assignee: Thomas Tauber-Marshall
>             Fix For: Impala 2.12.0
>
>
> Impala partitions and sorts rows according to the target table's partitioning scheme before inserting them into Kudu to improve the performance of large inserts.
> IMPALA-5546 added the ability to create unpartitioned Kudu tables.We still add the partition and sort when inserting into these tables even though its just wasted work.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)