You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@kylin.apache.org by Chao Long <wa...@qq.com> on 2018/12/10 10:19:00 UTC

回复:Precisely Count Distinct cause spark data skew

Hi,
  You can try to set this parameter "kylin.engine.mr.uhc-reducer-count" a larger value in your kylin.properties or cube level properties(cube design page->Configuration Overwrites).


------------------
Best Regards,
Chao Long


 




------------------ 原始邮件 ------------------
发件人: "|明の"<15...@qq.com>;
发送时间: 2018年12月10日(星期一) 下午5:47
收件人: "user"<us...@kylin.apache.org>;

主题: Precisely Count Distinct cause spark data skew



Hi all~
      It's my first time to ask questions,nice to meet you !


        I have builded my cube with Precisely Count Distinct ,then the spark data skew happened .In addition ,the spark input data was vary large !
        
        Why is that ,how can I fix it?


                                                                                                           Thanks all~