You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Baike Xia (Jira)" <ji...@apache.org> on 2022/09/21 03:00:00 UTC

[jira] [Created] (IMPALA-11600) Support Bucketed Table And Related Optimizations

Baike Xia created IMPALA-11600:
----------------------------------

             Summary: Support Bucketed Table And Related Optimizations
                 Key: IMPALA-11600
                 URL: https://issues.apache.org/jira/browse/IMPALA-11600
             Project: IMPALA
          Issue Type: New Feature
          Components: Backend, Distributed Exec, Frontend
            Reporter: Baike Xia


In Hive, we can create bucket tables, divide data in fine-grained ways, and publish data to different files based on bucket columns. Like this, we can make specific optimizations to the Query to speed up the Query. 

I think it would be exciting for Impala to have support for bucket table creation and related optimizations.

The following document is a design document that supports the creation of bucket tables. If you are interested,  welcome to give some suggestions.

[Support Bucketed Table And Related Optimizations|https://docs.google.com/document/d/1-hvGK-Ng-GtPqxbgB7rTPfrkCtYLVFDehn9ybL-mGUc/edit#heading=h.3y9ae6d7rbnq]

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)