You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gang Tim Liu (JIRA)" <ji...@apache.org> on 2012/05/31 20:57:23 UTC

[jira] [Created] (HIVE-3073) Hive List Bucketing - DML support (single column/manual load)

Gang Tim Liu created HIVE-3073:
----------------------------------

             Summary: Hive List Bucketing - DML support (single column/manual load)
                 Key: HIVE-3073
                 URL: https://issues.apache.org/jira/browse/HIVE-3073
             Project: Hive
          Issue Type: New Feature
          Components: SQL
            Reporter: Gang Tim Liu
            Assignee: Gang Tim Liu


If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:

https://cwiki.apache.org/Hive/listbucketing.html

This jira issue will track DML change for the feature:
1. single skewed column
2. manual load data


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.12
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.24
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Status: Patch Available  (was: Open)

Another patch. thanks
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.18
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.15
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.27
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Work started] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on HIVE-3073 started by Gang Tim Liu.

> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Work started] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on HIVE-3073 started by Gang Tim Liu.

> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.26
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510536#comment-13510536 ] 

Gang Tim Liu commented on HIVE-3073:
------------------------------------

Namit thank you so much Tim



                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.21
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support (single column/manual load)

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13287757#comment-13287757 ] 

Gang Tim Liu commented on HIVE-3073:
------------------------------------

Hi Carl, yes, i have sent out email to dev@. thank you very much for your reminder. thanks Tim 
                
> Hive List Bucketing - DML support (single column/manual load)
> -------------------------------------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-3073:
-----------------------------

      Resolution: Fixed
    Hadoop Flags: Reviewed
          Status: Resolved  (was: Patch Available)

Committed. Thanks Tim
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13511203#comment-13511203 ] 

Hudson commented on HIVE-3073:
------------------------------

Integrated in Hive-trunk-h0.21 #1835 (See [https://builds.apache.org/job/Hive-trunk-h0.21/1835/])
    HIVE-3073 Hive List Bucketing - DML support
(Gang Tim Liu via namit) (Revision 1417374)

     Result = FAILURE
namit : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1417374
Files : 
* /hive/trunk/common/src/java/org/apache/hadoop/hive/common/FileUtils.java
* /hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
* /hive/trunk/conf/hive-default.xml.template
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ErrorMsg.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/MergeWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/RCFileMergeMapper.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/formatting/MetaDataFormatUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRFileSink1.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/PrunerOperatorFactory.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPruner.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPrunerUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/AlterTablePartMergeFilesDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ConditionalResolverMergeFiles.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/FileSinkDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ListBucketingCtx.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/LoadTableDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java
* /hive/trunk/ql/src/test/queries/clientnegative/column_change_skewedcol_type1.q
* /hive/trunk/ql/src/test/queries/clientnegative/column_rename5.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_col_name_value_no_mismatch.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_dup_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_failure_invalid_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/invalid_config1.q
* /hive/trunk/ql/src/test/queries/clientnegative/load_stored_as_dirs.q
* /hive/trunk/ql/src/test/queries/clientpositive/alter_skewed_table.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_alter_list_bucketing_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_skewed_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_4.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_5.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_6.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_7.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_8.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_9.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_3.q
* /hive/trunk/ql/src/test/results/clientnegative/invalid_config1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_3.q.out
* /hive/trunk/ql/src/test/results/compiler/plan/case_sensitivity.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/groupby1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input7.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input9.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input_testsequencefile.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample7.q.xml
* /hive/trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java

                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-3073:
-----------------------------

    Status: Open  (was: Patch Available)

minor comments
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.13
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.22
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Status: Patch Available  (was: In Progress)

patch is available on both phabricator and attachment.
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-3073:
-----------------------------

    Status: Open  (was: Patch Available)

comments
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510875#comment-13510875 ] 

Hudson commented on HIVE-3073:
------------------------------

Integrated in Hive-trunk-h0.21 #1834 (See [https://builds.apache.org/job/Hive-trunk-h0.21/1834/])
    HIVE-3073 Hive List Bucketing - DML support
(Gang Tim Liu via namit) (Revision 1417374)

     Result = ABORTED
namit : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1417374
Files : 
* /hive/trunk/common/src/java/org/apache/hadoop/hive/common/FileUtils.java
* /hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
* /hive/trunk/conf/hive-default.xml.template
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ErrorMsg.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/MergeWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/RCFileMergeMapper.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/formatting/MetaDataFormatUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRFileSink1.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/PrunerOperatorFactory.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPruner.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/listbucketingpruner/ListBucketingPrunerUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/AlterTablePartMergeFilesDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ConditionalResolverMergeFiles.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/FileSinkDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ListBucketingCtx.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/LoadTableDesc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java
* /hive/trunk/ql/src/test/queries/clientnegative/column_change_skewedcol_type1.q
* /hive/trunk/ql/src/test/queries/clientnegative/column_rename5.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_col_name_value_no_mismatch.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_dup_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/create_skewed_table_failure_invalid_col_name.q
* /hive/trunk/ql/src/test/queries/clientnegative/invalid_config1.q
* /hive/trunk/ql/src/test/queries/clientnegative/load_stored_as_dirs.q
* /hive/trunk/ql/src/test/queries/clientpositive/alter_skewed_table.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_alter_list_bucketing_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/create_skewed_table1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_4.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_5.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_6.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_7.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_8.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_9.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_3.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_1.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_2.q
* /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_oneskew_3.q
* /hive/trunk/ql/src/test/results/clientnegative/invalid_config1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_3.q.out
* /hive/trunk/ql/src/test/results/compiler/plan/case_sensitivity.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/groupby1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input7.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input9.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/input_testsequencefile.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
* /hive/trunk/ql/src/test/results/compiler/plan/sample7.q.xml
* /hive/trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java

                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Status: Patch Available  (was: Open)

patch is available.
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Status: Patch Available  (was: In Progress)

patch is available.
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-3073:
-----------------------------

    Status: Open  (was: Patch Available)

comments on phabricator
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Attachment: HIVE-3073.patch.19
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Work started] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on HIVE-3073 started by Gang Tim Liu.

> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-3073:
-----------------------------

    Status: Open  (was: Patch Available)

comments
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support (single column/manual load)

Posted by "Carl Steinbach (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13287077#comment-13287077 ] 

Carl Steinbach commented on HIVE-3073:
--------------------------------------

Please send an email to dev@hive letting people know that this proposal is ready for review. Thanks!
                
> Hive List Bucketing - DML support (single column/manual load)
> -------------------------------------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13510366#comment-13510366 ] 

Namit Jain commented on HIVE-3073:
----------------------------------

+1

looks good, running tests
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24, HIVE-3073.patch.26, HIVE-3073.patch.27
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Affects Version/s: 0.10.0
               Status: Patch Available  (was: Open)

https://reviews.facebook.net/D6021
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Status: Patch Available  (was: In Progress)

patch is available. thanks
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22, HIVE-3073.patch.24
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Gang Tim Liu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Tim Liu updated HIVE-3073:
-------------------------------

    Summary: Hive List Bucketing - DML support   (was: Hive List Bucketing - DML support (single column/manual load))
    
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3073) Hive List Bucketing - DML support

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-3073:
-----------------------------

    Status: Open  (was: Patch Available)

comments on phabricator
                
> Hive List Bucketing - DML support 
> ----------------------------------
>
>                 Key: HIVE-3073
>                 URL: https://issues.apache.org/jira/browse/HIVE-3073
>             Project: Hive
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 0.10.0
>            Reporter: Gang Tim Liu
>            Assignee: Gang Tim Liu
>         Attachments: HIVE-3073.patch.12, HIVE-3073.patch.13, HIVE-3073.patch.15, HIVE-3073.patch.18, HIVE-3073.patch.19, HIVE-3073.patch.21, HIVE-3073.patch.22
>
>
> If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it:
> https://cwiki.apache.org/Hive/listbucketing.html
> This jira issue will track DML change for the feature:
> 1. single skewed column
> 2. manual load data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira