You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-dev@hadoop.apache.org by "Bryan Duxbury (JIRA)" <ji...@apache.org> on 2007/12/06 00:27:43 UTC

[jira] Commented: (HADOOP-2006) Aggregate Functions in select statement

    [ https://issues.apache.org/jira/browse/HADOOP-2006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548874 ] 

Bryan Duxbury commented on HADOOP-2006:
---------------------------------------

This seems like a bad idea. You could have TONS of data, and aggregating it in one place would take forever. If you want to produce aggregate info, you should probably fire off a Map Reduce job, no?

> Aggregate Functions in select statement
> ---------------------------------------
>
>                 Key: HADOOP-2006
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2006
>             Project: Hadoop
>          Issue Type: Sub-task
>          Components: contrib/hbase
>    Affects Versions: 0.14.1
>            Reporter: Edward Yoon
>            Assignee: Edward Yoon
>            Priority: Minor
>             Fix For: 0.16.0
>
>
> Aggregation functions on collections of data values: average, minimum, maximum, sum, count.
> Group rows by value of an columnfamily and apply aggregate function independently to each group of rows.
>  * <Grouping columnfamilies>  ƒ ~function_list~ (Relation)
> {code}
> select producer, avg(year) from movieLog_table group by producer
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.