You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kylin.apache.org by lk_hadoop <lk...@163.com> on 2019/11/11 09:27:08 UTC
Fw: Kylin Hybrids Cont distinct got wrong value
2019-11-11
lk_hadoop
发件人:"lk_hadoop"<lk...@163.com>
发送时间:2019-11-08 10:34
主题:Kylin Hybrids Cont distinct got wrong value
收件人:"user"<us...@kylin.apache.org>
抄送:
hi , all :
I have a Measure with Cont distinct algorithm , when I use a Hybrid ,The Hybrid include two cube, each cube have one years data , I can't get the meaningful result when across two year.
-- get each year Cont distinct Measure value
SELECT ENT_NAME, ORD_YEAR ,SUM(PAID_IN_AMT) AS sum_PAID_IN_AMT, COUNT(DISTINCT TGOODS_ID) AS count_d_TGOODS_ID
FROM gjst.SH_FETCH_SALE_BASE_FACT_ALL_NEW2
WHERE ENT_NAME IN ('仁泰')
AND ORD_YEAR IN ('2018','2019')
GROUP BY ENT_NAME, ORD_YEAR
ENT_NAME | ORD_YEAR | SUM_PAID_IN_AMT |COUNT_D_TGOODS_ID
仁泰 2019 186483922.94 7087
仁泰 2018 236461205.81 9418
-- get two year aggregation value
SELECT ENT_NAME ,SUM(PAID_IN_AMT) AS sum_PAID_IN_AMT, COUNT(DISTINCT TGOODS_ID) AS count_d_TGOODS_ID
FROM gjst.SH_FETCH_SALE_BASE_FACT_ALL_NEW2
WHERE ENT_NAME IN ('仁泰')
AND ORD_YEAR IN ('2018','2019')
GROUP BY ENT_NAME
ENT_NAME | SUM_PAID_IN_AMT |COUNT_D_TGOODS_ID
仁泰 422945128.75 9418
Is there any way to resolve it ?
2019-11-08
lk_hadoop
Re:Fw: Kylin Hybrids Cont distinct got wrong value
Posted by nichunen <ni...@apache.org>.
Hello,
Count distinct is not supported in Hybrid.
*Please note, Hybrid model is not suitable for “bitmap” count distinct measures’s merge across cubes, please have the partition date as a group by field in the SQL query. *
http://kylin.apache.org/docs/tutorial/hybrid.html
Best regards,
Ni Chunen / George
On 11/11/2019 17:27,lk_hadoop<lk...@163.com> wrote:
2019-11-11
lk_hadoop
发件人:"lk_hadoop"<lk...@163.com>
发送时间:2019-11-08 10:34
主题:Kylin Hybrids Cont distinct got wrong value
收件人:"user"<us...@kylin.apache.org>
抄送:
hi , all :
I have a Measure with Cont distinct algorithm , when I use a Hybrid ,The Hybrid include two cube, each cube have one years data , I can't get the meaningful result when across two year.
-- get each year Cont distinct Measure value
SELECT ENT_NAME, ORD_YEAR ,SUM(PAID_IN_AMT) AS sum_PAID_IN_AMT, COUNT(DISTINCT TGOODS_ID) AS count_d_TGOODS_ID
FROM gjst.SH_FETCH_SALE_BASE_FACT_ALL_NEW2
WHERE ENT_NAME IN ('仁泰')
AND ORD_YEAR IN ('2018','2019')
GROUP BY ENT_NAME, ORD_YEAR
ENT_NAME | ORD_YEAR | SUM_PAID_IN_AMT |COUNT_D_TGOODS_ID
仁泰 2019 186483922.94 7087
仁泰 2018 236461205.81 9418
-- get two year aggregation value
SELECT ENT_NAME ,SUM(PAID_IN_AMT) AS sum_PAID_IN_AMT, COUNT(DISTINCT TGOODS_ID) AS count_d_TGOODS_ID
FROM gjst.SH_FETCH_SALE_BASE_FACT_ALL_NEW2
WHERE ENT_NAME IN ('仁泰')
AND ORD_YEAR IN ('2018','2019')
GROUP BY ENT_NAME
ENT_NAME | SUM_PAID_IN_AMT |COUNT_D_TGOODS_ID
仁泰 422945128.75 9418
Is there any way to resolve it ?
2019-11-08
lk_hadoop