You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@kylin.apache.org by Mars J <xu...@gmail.com> on 2016/05/12 11:50:41 UTC

Re: Incremental Build

Hi shoafeng,
      If I use the same partition column in hive, the partition column is a
date type.every day there will produce a sub-directory under the hive
table, thus it'll more and more sub-directory under hdfs.  besides, the
first step Create Intermediate Flat Hive Table is below 10 mins, and this
step is not the most consumed step when cubing, maybe this because my data
volumn is not enough huge.
     Can u tell me how the same partition column in hive and kylin to
improve the performance in generating the flat table step, for example, how
many minutes or hours can it save to cubing?

2016-04-09 15:43 GMT+08:00 ShaoFeng Shi <sh...@apache.org>:

> It is recommended to use the same partition column in hive and kylin, that
> would gain better performance in generating the flat table step, but this
> is not required.
>
> 2016-04-09 9:36 GMT+08:00 Mars J <xu...@gmail.com>:
>
>> Hi ,
>>
>>        Are hive fact tables and dimensiontal tables should be date-column
>> partition table when incremental building by date ?
>>
>
>
>
> --
> Best regards,
>
> Shaofeng Shi
>
>