You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Baike Xia (Jira)" <ji...@apache.org> on 2022/09/21 03:00:00 UTC
[jira] [Created] (IMPALA-11600) Support Bucketed Table And Related Optimizations
Baike Xia created IMPALA-11600:
----------------------------------
Summary: Support Bucketed Table And Related Optimizations
Key: IMPALA-11600
URL: https://issues.apache.org/jira/browse/IMPALA-11600
Project: IMPALA
Issue Type: New Feature
Components: Backend, Distributed Exec, Frontend
Reporter: Baike Xia
In Hive, we can create bucket tables, divide data in fine-grained ways, and publish data to different files based on bucket columns. Like this, we can make specific optimizations to the Query to speed up the Query.
I think it would be exciting for Impala to have support for bucket table creation and related optimizations.
The following document is a design document that supports the creation of bucket tables. If you are interested, welcome to give some suggestions.
[Support Bucketed Table And Related Optimizations|https://docs.google.com/document/d/1-hvGK-Ng-GtPqxbgB7rTPfrkCtYLVFDehn9ybL-mGUc/edit#heading=h.3y9ae6d7rbnq]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)