You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gang Tim Liu (JIRA)" <ji...@apache.org> on 2012/05/31 20:57:23 UTC
[jira] [Created] (HIVE-3073) Hive List Bucketing - DML support
(single column/manual load)
Gang Tim Liu created HIVE-3073:
----------------------------------
Summary: Hive List Bucketing - DML support (single column/manual load)
Key: HIVE-3073
URL: https://issues.apache.org/jira/browse/HIVE-3073
Project: Hive
Issue Type: New Feature
Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
https://cwiki.apache.org/Hive/listbucketing.html
This jira issue will track DML change for the feature:
1. single skewed column
2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.12
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.24
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Status: Patch Available (was: Open)
Another patch. thanks
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.18
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.15
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.27
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on HIVE-3073 started by Gang Tim Liu.
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on HIVE-3073 started by Gang Tim Liu.
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.26
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510536#comment-13510536 ]
Gang Tim Liu commented on HIVE-3073:
------------------------------------
Namit thank you so much Tim
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.21
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support
(single column/manual load)
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13287757#comment-13287757 ]
Gang Tim Liu commented on HIVE-3073:
------------------------------------
Hi Carl, yes, i have sent out email to dev@. thank you very much for your reminder. thanks Tim
> Hive List Bucketing - DML support (single column/manual load)
> -------------------------------------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-3073:
-----------------------------
Resolution: Fixed
Hadoop Flags: Reviewed
Status: Resolved (was: Patch Available)
Committed. Thanks Tim
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13511203#comment-13511203 ]
Hudson commented on HIVE-3073:
------------------------------
Integrated in Hive-trunk-h0.21 #1835 (See [https://builds.apache.org/job/Hive-trunk-h0.21/1835/])
HIVE-3073 Hive List Bucketing - DML support
(Gang Tim Liu via namit) (Revision 1417374)
Result = FAILURE
namit : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1417374
Files :
* /hive/trunk/common/src/java/org/apache/hadoop/hive/common/FileUtils.java
* /hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
* /hive/trunk/conf/hive-default.xml.template
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ErrorMsg.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/MergeWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/RCFileMergeMapper.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/formatting/MetaDataFormatUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRFileSink1.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/PrunerOperatorFactory.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPruner.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPrunerUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/AlterTablePartMergeFilesDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ConditionalResolverMergeFiles.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/FileSinkDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ListBucketingCtx.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/LoadTableDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java
* /hive/trunk/ql/src/test/queries/clientnegative/column_change_skewedcol_type1.q
* /hive/trunk/ql/src/test/queries/clientnegative/column_rename5.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_col_name_value_no_mismatch.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_dup_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_failure_invalid_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/invalid_config1.q
* /hive/trunk/ql/src/test/queries/clientnegative/load_stored_as_dirs.q
* /hive/trunk/ql/src/test/queries/clientpositive/alter_skewed_table.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_alter_list_bucketing_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_skewed_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_4.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_5.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_6.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_7.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_8.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_9.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_3.q
* /hive/trunk/ql/src/test/results/clientnegative/invalid_config1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_3.q.out
* /hive/trunk/ql/src/test/results/compiler/plan/case_sensitivity.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/groupby1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input7.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input9.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input_testsequencefile.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample7.q.xml
* /hive/trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-3073:
-----------------------------
Status: Open (was: Patch Available)
minor comments
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.13
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.22
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Status: Patch Available (was: In Progress)
patch is available on both phabricator and attachment.
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-3073:
-----------------------------
Status: Open (was: Patch Available)
comments
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510875#comment-13510875 ]
Hudson commented on HIVE-3073:
------------------------------
Integrated in Hive-trunk-h0.21 #1834 (See [https://builds.apache.org/job/Hive-trunk-h0.21/1834/])
HIVE-3073 Hive List Bucketing - DML support
(Gang Tim Liu via namit) (Revision 1417374)
Result = ABORTED
namit : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1417374
Files :
* /hive/trunk/common/src/java/org/apache/hadoop/hive/common/FileUtils.java
* /hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
* /hive/trunk/conf/hive-default.xml.template
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ErrorMsg.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/MergeWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/RCFileMergeMapper.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/formatting/MetaDataFormatUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRFileSink1.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/PrunerOperatorFactory.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPruner.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPrunerUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/AlterTablePartMergeFilesDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ConditionalResolverMergeFiles.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/FileSinkDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ListBucketingCtx.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/LoadTableDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java
* /hive/trunk/ql/src/test/queries/clientnegative/column_change_skewedcol_type1.q
* /hive/trunk/ql/src/test/queries/clientnegative/column_rename5.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_col_name_value_no_mismatch.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_dup_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_failure_invalid_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/invalid_config1.q
* /hive/trunk/ql/src/test/queries/clientnegative/load_stored_as_dirs.q
* /hive/trunk/ql/src/test/queries/clientpositive/alter_skewed_table.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_alter_list_bucketing_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_skewed_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_4.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_5.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_6.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_7.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_8.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_9.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_3.q
* /hive/trunk/ql/src/test/results/clientnegative/invalid_config1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_3.q.out
* /hive/trunk/ql/src/test/results/compiler/plan/case_sensitivity.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/groupby1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input7.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input9.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input_testsequencefile.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample7.q.xml
* /hive/trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Status: Patch Available (was: Open)
patch is available.
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Status: Patch Available (was: In Progress)
patch is available.
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-3073:
-----------------------------
Status: Open (was: Patch Available)
comments on phabricator
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Attachment: HIVE-3073.patch.19
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on HIVE-3073 started by Gang Tim Liu.
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-3073:
-----------------------------
Status: Open (was: Patch Available)
comments
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support
(single column/manual load)
Posted by "Carl Steinbach (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13287077#comment-13287077 ]
Carl Steinbach commented on HIVE-3073:
--------------------------------------
Please send an email to dev@hive letting people know that this proposal is ready for review. Thanks!
> Hive List Bucketing - DML support (single column/manual load)
> -------------------------------------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510366#comment-13510366 ]
Namit Jain commented on HIVE-3073:
----------------------------------
+1
looks good, running tests
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Affects Version/s: 0.10.0
Status: Patch Available (was: Open)
https://reviews.facebook.net/D6021
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Status: Patch Available (was: In Progress)
patch is available. thanks
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gang Tim Liu updated HIVE-3073:
-------------------------------
Summary: Hive List Bucketing - DML support (was: Hive List Bucketing - DML support (single column/manual load))
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-3073:
-----------------------------
Status: Open (was: Patch Available)
comments on phabricator
> Hive List Bucketing - DML support
> ----------------------------------
>
> Key: HIVE-3073
> URL: https://issues.apache.org/jira/browse/HIVE-3073
> Project: Hive
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 0.10.0
> Reporter: Gang Tim Liu
> Assignee: Gang Tim Liu
> Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira