You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "J.P Feng (JIRA)" <ji...@apache.org> on 2016/12/30 13:53:58 UTC

[jira] [Issue Comment Deleted] (SPARK-19037) Run count(distinct name) from sub query found some errors

     [ https://issues.apache.org/jira/browse/SPARK-19037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

J.P Feng updated SPARK-19037:
-----------------------------
    Comment: was deleted

(was: errors logs:)

> Run count(distinct name) from sub query found some errors
> ---------------------------------------------------------
>
>                 Key: SPARK-19037
>                 URL: https://issues.apache.org/jira/browse/SPARK-19037
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Shell, SQL
>    Affects Versions: 2.1.0
>         Environment: spark 2.1.0, scala 2.11 
>            Reporter: J.P Feng
>              Labels: distinct, sparkSQL, sub-query
>
> when i use spark-shell or spark-sql to execute count(distinct name) from subquery, some errors occur:
> select count(distinct name) from (select * from mytest limit 10) as a
> if i do this in hive-server2, i can get the correct result.
> if i just execute select count(name) from (select * from mytest limit 10) as a, i can also get the right result.
> besides, i found the same errors when i use max(), distinct(),groupby() with subquery.
> I think there maybe some bugs when doing key-reduce jobs with subquery.
> I will add the errors in new comment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org