You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Raghotham Murthy (JIRA)" <ji...@apache.org> on 2009/03/24 19:00:52 UTC

[jira] Commented: (HIVE-362) avg() returns null if any item in the list is null; create function that doesn't

    [ https://issues.apache.org/jira/browse/HIVE-362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12688769#action_12688769 ] 

Raghotham Murthy commented on HIVE-362:
---------------------------------------

Aggregates are supposed to ignore nulls according to the standard. The code for SUM and AVG seems to indicate that they do ignore NULLs. Are you getting NULLs?

> avg() returns null if any item in the list is null; create function that doesn't
> --------------------------------------------------------------------------------
>
>                 Key: HIVE-362
>                 URL: https://issues.apache.org/jira/browse/HIVE-362
>             Project: Hadoop Hive
>          Issue Type: Wish
>          Components: Clients, Query Processor
>            Reporter: Adam Kramer
>
> Some of the current aggregates (sum, avg) have a fairly standard behavior: If any item in the list is NULL, the sum, average, etc., cannot be computed. And so, NULL is returned.
> 1) If this is the case, the query should return much faster--see a null, return NULL, exit(0).
> 2) It would be nice to have versions or ways to use these functions with NULL data--specifically, to treat the NULL as zero or to ignore the NULL and return the results for non-NULL data.
> This also would apply to the variance functions referenced in https://issues.apache.org/jira/browse/HIVE-165

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.