You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Holman Lan (JIRA)" <ji...@apache.org> on 2015/06/13 02:26:00 UTC

[jira] [Commented] (SPARK-5680) Sum function on all null values, should return zero

    [ https://issues.apache.org/jira/browse/SPARK-5680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14584324#comment-14584324 ] 

Holman Lan commented on SPARK-5680:
-----------------------------------

Hi Guys,

Just curious about this change to return 0 for the SUM function when all values are NULL as many other data sources (Hive, Impala, SQL Server ...) return NULL in this case. Could you kindly share the motivation behind the change?

Many thanks,
Holman

> Sum function on all null values, should return zero
> ---------------------------------------------------
>
>                 Key: SPARK-5680
>                 URL: https://issues.apache.org/jira/browse/SPARK-5680
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>            Reporter: Venkata Ramana G
>            Assignee: Venkata Ramana G
>            Priority: Minor
>             Fix For: 1.3.1, 1.4.0
>
>
> SELECT  sum('a'),  avg('a'),  variance('a'),  std('a') FROM src;
> Current output:
> NULL	NULL	NULL	NULL
> Expected output:
> 0.0	NULL	NULL	NULL
> This fixes hive udaf_number_format.q 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org