You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flink.apache.org by Fanbin Bu <fa...@coinbase.com> on 2019/09/19 22:03:27 UTC

count distinct not supported in batch?

Hi,

Just found that count distinct is supported in streaming but not in batch
(version 1.8), is there any plan to add this to batch?

SELECT
  user_id
  , hop_end(created_at, interval '30' second, interval '30' second) as bucket_ts
  , count(distinct name)
FROM $table
GROUP BY
  user_id
  , hop(created_at, interval '30' second, interval '30' second)


Thanks,
Fanbin

Re: count distinct not supported in batch?

Posted by JingsongLee <lz...@aliyun.com>.
Hi fanbin:
It is "distinct aggregates for group window" in batch sql mode.
Now,
legacy planner: not support.
blink planner: not support.
There is no clear plan yet.
But if the demand is strong, we can consider supporting it.

Best,
Jingsong Lee


------------------------------------------------------------------
From:Fanbin Bu <fa...@coinbase.com>
Send Time:2019年9月20日(星期五) 06:03
To:user <us...@flink.apache.org>
Subject:count distinct not supported in batch?

Hi,

Just found that count distinct is supported in streaming but not in batch (version 1.8), is there any plan to add this to batch?

SELECT
  user_id
  , hop_end(created_at, interval '30' second, interval '30' second) as bucket_ts
  , count(distinct name)
FROM $table
GROUP BY
  user_id
  , hop(created_at, interval '30' second, interval '30' second)

Thanks,
Fanbin