You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kylin.apache.org by 猪兜兜 <12...@qq.com> on 2020/03/05 11:41:25 UTC

回复: [DISCUSS] Upgrade Kylin's dependency to Hadoop 3 / HBase 2

恻然





------------------&nbsp;原始邮件&nbsp;------------------
发件人:&nbsp;"Luke Han"<luke.hq@gmail.com&gt;;
发送时间:&nbsp;2020年3月5日(星期四) 下午3:49
收件人:&nbsp;"user"<user@kylin.apache.org&gt;;
抄送:&nbsp;"dev"<dev@kylin.apache.org&gt;;
主题:&nbsp;Re: [DISCUSS] Upgrade Kylin's dependency to Hadoop 3 / HBase 2



+1 the new Kylin version should target Hadoop 3

Best Regards!
---------------------

Luke Han


On Mon, Mar 2, 2020 at 4:41 PM Billy Liu <billyliu@apache.org&gt; wrote:

&gt; +1. Let's move to Hadoop3
&gt;
&gt; With Warm regards
&gt;
&gt; Billy Liu
&gt;
&gt;
&gt; ShaoFeng Shi <shaofengshi@apache.org&gt; 于2020年2月27日周四 下午10:07写道:
&gt;
&gt;&gt; Hi Yang,
&gt;&gt;
&gt;&gt; The main difference between 2.6 and 3.0 is the new real-time OLAP
&gt;&gt; feature. Hadoop 2 users can select either of them, depends on whether they
&gt;&gt; need the real-time feature.
&gt;&gt;
&gt;&gt; After 3.0, the next major features would be the Flink cube engine
&gt;&gt; (planned in v3.1) and the Parquet storage (early stage, maybe in v4.0).
&gt;&gt;
&gt;&gt; When the parquet storage is released, as the dependency on HBase can be
&gt;&gt; dropped, then we assume the API issue will easier than today. We can
&gt;&gt; re-evaluate the possibility to support Hadoop 2.
&gt;&gt;
&gt;&gt; So I think the impact on today's Hadoop 2 users is acceptable. Not
&gt;&gt; mention that they still can manually compile that.
&gt;&gt;
&gt;&gt;
&gt;&gt; Best regards,
&gt;&gt;
&gt;&gt; Shaofeng Shi 史少锋
&gt;&gt; Apache Kylin PMC
&gt;&gt; Email: shaofengshi@apache.org
&gt;&gt;
&gt;&gt; Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html
&gt;&gt; Join Kylin user mail group: user-subscribe@kylin.apache.org
&gt;&gt; Join Kylin dev mail group: dev-subscribe@kylin.apache.org
&gt;&gt;
&gt;&gt;
&gt;&gt;
&gt;&gt;
&gt;&gt; Li Yang <liyang@apache.org&gt; 于2020年2月27日周四 上午7:37写道:
&gt;&gt;
&gt;&gt;&gt; The proposal means Kylin 3.0 will be the last major version that supports
&gt;&gt;&gt; Hadoop 2.
&gt;&gt;&gt;
&gt;&gt;&gt; What will be recommended version for Hadoop 2 users after this? I feel
&gt;&gt;&gt; the
&gt;&gt;&gt; latest stable version of 2.6 is better than 3.0.
&gt;&gt;&gt;
&gt;&gt;&gt; Anyway, I'm fine with moving focus to Hadoop 3. That is the direction.
&gt;&gt;&gt; However we shall also think about what it means for Hadoop 2 users.
&gt;&gt;&gt; Questions like below shall also be answered.
&gt;&gt;&gt;
&gt;&gt;&gt; - What is the recommended version/branch for Hadoop 2? (Btw, 3.0 does not
&gt;&gt;&gt; sound right here.)
&gt;&gt;&gt; - How that version/branch will be maintained?
&gt;&gt;&gt;
&gt;&gt;&gt; +1 in general
&gt;&gt;&gt;
&gt;&gt;&gt; Regards
&gt;&gt;&gt; -Yang
&gt;&gt;&gt;
&gt;&gt;&gt;
&gt;&gt;&gt; On Wed, Feb 26, 2020 at 5:36 PM Zhou Kang <zhoukangcn@outlook.com&gt;
&gt;&gt;&gt; wrote:
&gt;&gt;&gt;
&gt;&gt;&gt; &gt; +1
&gt;&gt;&gt; &gt;
&gt;&gt;&gt; &gt;
&gt;&gt;&gt; &gt; &gt; 2020年2月26日 下午3:48,ShaoFeng Shi <shaofengshi@apache.org&gt; 写道:
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Hello, Kylin users and developers,
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; As we know Hadoop 3 and HBase 2 have released for some time. Kylin
&gt;&gt;&gt; &gt; starts to support Hadoop 3&nbsp; since v2.5.0 in Sep 2018.&nbsp; As the APIs of
&gt;&gt;&gt; HBase
&gt;&gt;&gt; &gt; 1 and 2 are incompatible, we need to keep different branches for them.
&gt;&gt;&gt; And
&gt;&gt;&gt; &gt; in each release, we need to build separate packages and do a round of
&gt;&gt;&gt; &gt; testing for them separately. Furthermore, Cloudera's API difference
&gt;&gt;&gt; with
&gt;&gt;&gt; &gt; the Apache release makes the situation worse; We need to build 4 binary
&gt;&gt;&gt; &gt; packages for reach release. That has spent much of our manual effort
&gt;&gt;&gt; and
&gt;&gt;&gt; &gt; computing resources.
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Today, Hadoop 3 + HBase 2 becomes enough mature and stable for
&gt;&gt;&gt; &gt; production use; And we see more and more users are starting to use the
&gt;&gt;&gt; new
&gt;&gt;&gt; &gt; versions. We think it is time for Kylin to totally upgrade to the new
&gt;&gt;&gt; &gt; version. So that we can focus more on Kylin itself, instead of
&gt;&gt;&gt; environments.
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt;&nbsp; Here is my proposal:
&gt;&gt;&gt; &gt; &gt; 1) From Kylin 3.1,&nbsp; Hadoop/HBase version upgrades to 3.1/2.1 (or a
&gt;&gt;&gt; close
&gt;&gt;&gt; &gt; version);
&gt;&gt;&gt; &gt; &gt; 2) Hadoop 2 and HBase 1 users can use Kylin 3.0 and previous
&gt;&gt;&gt; releases;
&gt;&gt;&gt; &gt; &gt; 3) We will re-evaluate the need for building binary packages for
&gt;&gt;&gt; &gt; Cloudera release. (we may raise another discuss)
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Please let us know your comments. And please also understand with the
&gt;&gt;&gt; &gt; limited resource we couldn't support multiple Hadoop versions...
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Thanks!
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Best regards,
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Shaofeng Shi 史少锋
&gt;&gt;&gt; &gt; &gt; Apache Kylin PMC
&gt;&gt;&gt; &gt; &gt; Email: shaofengshi@apache.org
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt; Apache Kylin FAQ:
&gt;&gt;&gt; https://kylin.apache.org/docs/gettingstarted/faq.html
&gt;&gt;&gt; &gt; &gt; Join Kylin user mail group: user-subscribe@kylin.apache.org
&gt;&gt;&gt; &gt; &gt; Join Kylin dev mail group: dev-subscribe@kylin.apache.org
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt; &gt;
&gt;&gt;&gt; &gt;
&gt;&gt;&gt; &gt;
&gt;&gt;&gt;
&gt;&gt;