You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@kylin.apache.org by ro...@stratebi.com on 2018/04/02 16:00:20 UTC
Dimension update issue
Hi,
I have some questions about how Kylin refresh dimensions tables. In case
you delete some instances of the dimension on hive table that are not used
by any fact in fact table, These values will be deleted from dimension table
on next cube segment built? We have to refresh old segments?
In our case we have added by mistake a lot of empty future days to a Date
dimension (80 years). Then we have built some days of the cube. However, as
this big date dimensions leads to performance issues, we have deleted most
of future years from Date dimension. However, after built a new segment of
the cube, the deleted rows on Hive table they have not been deleted in Kylin
cube table. How can we refresh the dimension table on Kylin properly?
Thanks in advance!
Roberto Tardío Olmos
Senior Big Data & Business Intelligence Consultant
Avenida de Brasil, 17, Planta 16.28020 Madrid
Fijo: 91.788.34.10
http://bigdata.stratebi.com/
http://www.stratebi.com
RE: Dimension update issue
Posted by ro...@stratebi.com.
Many thanks another time ShaoFeng. We have refreshed segments that were built with modified dimension table and now we see the changes on Date dimension table.
Kind regards
From: ShaoFeng Shi [mailto:shaofengshi@apache.org]
Sent: martes, 3 de abril de 2018 4:22
To: user <us...@kylin.apache.org>
Subject: Re: Dimension update issue
Hi Roberto,
Kylin will take the snapshot for dimension tables, and persist the snapshots together with the cube segments.
If you have modified the dimension table in Hive, and expects to see the changes for history segments, the proper way is to refresh the history segment(s), which will refresh the snapshot.
Hope this clarifies your questions.
2018-04-03 0:00 GMT+08:00 <roberto.tardio@stratebi.com <ma...@stratebi.com> >:
Hi,
I have some questions about how Kylin refresh dimensions tables. In case you delete some instances of the dimension on hive table that are not used by any fact in fact table, These values will be deleted from dimension table on next cube segment built? We have to refresh old segments?
In our case we have added by mistake a lot of empty future days to a Date dimension (80 years). Then we have built some days of the cube. However, as this big date dimensions leads to performance issues, we have deleted most of future years from Date dimension. However, after built a new segment of the cube, the deleted rows on Hive table they have not been deleted in Kylin cube table. How can we refresh the dimension table on Kylin properly?
Thanks in advance!
Roberto Tardío Olmos
Senior Big Data & Business Intelligence Consultant
Avenida de Brasil, 17 <https://maps.google.com/?q=Avenida+de+Brasil,+17&entry=gmail&source=g> , Planta 16.28020 Madrid
Fijo: 91.788.34.10
http://bigdata.stratebi.com/
http://www.stratebi.com
--
Best regards,
Shaofeng Shi 史少锋
Re: Dimension update issue
Posted by ShaoFeng Shi <sh...@apache.org>.
Hi Roberto,
Kylin will take the snapshot for dimension tables, and persist the
snapshots together with the cube segments.
If you have modified the dimension table in Hive, and expects to see the
changes for history segments, the proper way is to refresh the history
segment(s), which will refresh the snapshot.
Hope this clarifies your questions.
2018-04-03 0:00 GMT+08:00 <ro...@stratebi.com>:
> Hi,
>
>
>
> I have some questions about how Kylin refresh dimensions tables. In case
> you delete some instances of the dimension on hive table that are not used
> by any fact in fact table, These values will be deleted from dimension
> table on next cube segment built? We have to refresh old segments?
>
>
>
> In our case we have added by mistake a lot of empty future days to a Date
> dimension (80 years). Then we have built some days of the cube. However, as
> this big date dimensions leads to performance issues, we have deleted most
> of future years from Date dimension. However, after built a new segment of
> the cube, the deleted rows on Hive table they have not been deleted in
> Kylin cube table. How can we refresh the dimension table on Kylin properly?
>
>
>
> Thanks in advance!
>
>
>
> *Roberto Tardío Olmos*
>
> *Senior Big Data & Business Intelligence Consultant*
>
> Avenida de Brasil, 17
> <https://maps.google.com/?q=Avenida+de+Brasil,+17&entry=gmail&source=g>,
> Planta 16.28020 Madrid
>
> Fijo: 91.788.34.10
>
>
> [image:
> http://www.stratebi.com/image/layout_set_logo?img_id=21615&t=1486381163544]
>
>
>
> http://bigdata.stratebi.com/
>
>
>
> http://www.stratebi.com
>
>
>
--
Best regards,
Shaofeng Shi 史少锋