You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Hive QA (JIRA)" <ji...@apache.org> on 2018/04/16 18:55:00 UTC

[jira] [Commented] (HIVE-12369) Native Vector GroupBy (Part 1)

    [ https://issues.apache.org/jira/browse/HIVE-12369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16439863#comment-16439863 ] 

Hive QA commented on HIVE-12369:
--------------------------------

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:red}-1{color} | {color:red} patch {color} | {color:red}  0m 14s{color} | {color:red} /data/hiveptest/logs/PreCommit-HIVE-Build-10255/patches/PreCommit-HIVE-Build-10255.patch does not apply to master. Rebase required? Wrong Branch? See http://cwiki.apache.org/confluence/display/Hive/HowToContribute for help. {color} |
\\
\\
|| Subsystem || Report/Notes ||
| Console output | http://104.198.109.242/logs//PreCommit-HIVE-Build-10255/yetus.txt |
| Powered by | Apache Yetus    http://yetus.apache.org |


This message was automatically generated.



> Native Vector GroupBy (Part 1)
> ------------------------------
>
>                 Key: HIVE-12369
>                 URL: https://issues.apache.org/jira/browse/HIVE-12369
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>         Attachments: HIVE-12369.01.patch, HIVE-12369.02.patch, HIVE-12369.05.patch, HIVE-12369.06.patch, HIVE-12369.091.patch, HIVE-12369.094.patch, HIVE-12369.095.patch
>
>
> Implement Native Vector GroupBy using fast hash table technology developed for Native Vector MapJoin, etc.
> Patch is currently limited to a single Long key with a single COUNT aggregation.  Or, a single Long key and no aggregation also known as duplicate reduction.
> 3 new classes introduces that stored the count in the slot table and don't allocate hash elements:
> {noformat}
>   COUNT(column)  VectorGroupByHashLongKeyCountColumnOperator      
>   COUNT(key)     VectorGroupByHashLongKeyCountKeyOperator            
>   COUNT(*)       VectorGroupByHashLongKeyCountStarOperator           
> {noformat}
> And the duplicate reduction operator a single Long key:
> {noformat}
>   VectorGroupByHashLongKeyDuplicateReductionOperator
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)