You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kylin.apache.org by "ワンレイ (ケビンKevin)" <wa...@simplex-cn.co.jp> on 2015/03/20 07:39:05 UTC

Kylin vs Druid

Hi,

A quick question, how do you look at Kylin differently from Druid, another
tool that is built for providing large quantity of analytic data with low
latency? Are two tools to solve the similar problem with different
approaches?

Thanks!
Kevin

Re: Kylin vs Druid

Posted by Luke Han <lu...@gmail.com>.
9. Previous version, Druid is under GPL v2 license, the latest version of
Druid is changed to Apache license 2, Apache Kylin is under Apache License
v2.

Both Druid and Kylin are aim to resolve some big problems in Big Data area,
but go with different approach and serve for different purpose. As
definition, Kylin provides SQL interface and OLAP capability. Since the
industry and technology changes so frequency, each of them is evolving very
much.

Thanks.





Best Regards!
---------------------

Luke Han

2015-03-21 19:24 GMT-07:00 Li Yang <li...@apache.org>:

> 8. Kylin relies on Hadoop for cube build and HBase for storage. Druid has
> its own computation and storage cluster. For those who already running
> Hadoop & HBase, Kylin is a small addition to deployment. Druid always
> require its own cluster be deployed.
>
> Thanks Xu!  Great answer!   (FAQ also updated.)
>
> On Sun, Mar 22, 2015 at 1:33 AM, hongbin ma <ma...@apache.org> wrote:
>
> > thanks jiangxu
> >
> > i have included your answer as a FAQ:
> >
> >
> https://github.com/KylinOLAP/Kylin/blob/staging/docs/MISC/FAQ%20on%20Kylin%20Installation%20and%20Usage.md#what-is-the-difference-between-kylin-and-druid
> >
> > On Sat, Mar 21, 2015 at 8:30 AM, 蒋旭 <ji...@qq.com> wrote:
> >
> > > Let me give some simple explanations as below. Since I am more familiar
> > > with Kylin, these explanations may not be 100% accurate. :-)
> > >
> > >
> > > 1. Druid is more suitable for realtime analysis. Kylin is more focus on
> > > OLAP case.
> > >
> > >
> > > 2. Druid has good integration with Kafka as realtime streaming. The
> > > realtime capability of Kylin is still under development.
> > >
> > >
> > > 3. Druid use bitmap index as internal data structure. Kylin use bitmap
> > > index for realtime data and molap cube for historical data.
> > >
> > >
> > > 4. Kylin provide ANSI SQL. Druid provide specific query language.
> > >
> > >
> > > 5. Druid has limitation on table join. Kylin support star schema.
> > >
> > >
> > > 6. Kylin has good intergration with BI tools, such as Tableau or Excel.
> > > Druid can't integrate with existed BI tools.
> > >
> > >
> > > 7. Since Kylin support molap cube, it has very good performance for
> > > complex query on billion level data set. Since Druid need to scan the
> > full
> > > index, the performance may be hurt if the data set and query range is
> too
> > > big.
> > >
> > >
> > > Thanks
> > > Jiang Xu
> > >
> > >
> > > ------------------ 原始邮件 ------------------
> > > 发件人: ワンレイ(ケビンKevin) <wa...@simplex-cn.co.jp>
> > > 发送时间: 2015年03月20日 14:44
> > > 收件人: dev <de...@kylin.incubator.apache.org>
> > > 抄送: shigira <sh...@simplex-cn.co.jp>
> > > 主题: Kylin vs Druid
> > >
> > >
> > >
> > > Hi,
> > >
> > > A quick question, how do you look at Kylin differently from Druid,
> > another
> > > tool that is built for providing large quantity of analytic data with
> low
> > > latency? Are two tools to solve the similar problem with different
> > > approaches?
> > >
> > > Thanks!
> > > Kevin
> > >
> >
> >
> >
> > --
> > Regards,
> >
> > *Bin Mahone | 马洪宾*
> > Apache Kylin: http://kylin.io
> > Github: https://github.com/binmahone
> >
>

Re: Kylin vs Druid

Posted by Li Yang <li...@apache.org>.
8. Kylin relies on Hadoop for cube build and HBase for storage. Druid has
its own computation and storage cluster. For those who already running
Hadoop & HBase, Kylin is a small addition to deployment. Druid always
require its own cluster be deployed.

Thanks Xu!  Great answer!   (FAQ also updated.)

On Sun, Mar 22, 2015 at 1:33 AM, hongbin ma <ma...@apache.org> wrote:

> thanks jiangxu
>
> i have included your answer as a FAQ:
>
> https://github.com/KylinOLAP/Kylin/blob/staging/docs/MISC/FAQ%20on%20Kylin%20Installation%20and%20Usage.md#what-is-the-difference-between-kylin-and-druid
>
> On Sat, Mar 21, 2015 at 8:30 AM, 蒋旭 <ji...@qq.com> wrote:
>
> > Let me give some simple explanations as below. Since I am more familiar
> > with Kylin, these explanations may not be 100% accurate. :-)
> >
> >
> > 1. Druid is more suitable for realtime analysis. Kylin is more focus on
> > OLAP case.
> >
> >
> > 2. Druid has good integration with Kafka as realtime streaming. The
> > realtime capability of Kylin is still under development.
> >
> >
> > 3. Druid use bitmap index as internal data structure. Kylin use bitmap
> > index for realtime data and molap cube for historical data.
> >
> >
> > 4. Kylin provide ANSI SQL. Druid provide specific query language.
> >
> >
> > 5. Druid has limitation on table join. Kylin support star schema.
> >
> >
> > 6. Kylin has good intergration with BI tools, such as Tableau or Excel.
> > Druid can't integrate with existed BI tools.
> >
> >
> > 7. Since Kylin support molap cube, it has very good performance for
> > complex query on billion level data set. Since Druid need to scan the
> full
> > index, the performance may be hurt if the data set and query range is too
> > big.
> >
> >
> > Thanks
> > Jiang Xu
> >
> >
> > ------------------ 原始邮件 ------------------
> > 发件人: ワンレイ(ケビンKevin) <wa...@simplex-cn.co.jp>
> > 发送时间: 2015年03月20日 14:44
> > 收件人: dev <de...@kylin.incubator.apache.org>
> > 抄送: shigira <sh...@simplex-cn.co.jp>
> > 主题: Kylin vs Druid
> >
> >
> >
> > Hi,
> >
> > A quick question, how do you look at Kylin differently from Druid,
> another
> > tool that is built for providing large quantity of analytic data with low
> > latency? Are two tools to solve the similar problem with different
> > approaches?
> >
> > Thanks!
> > Kevin
> >
>
>
>
> --
> Regards,
>
> *Bin Mahone | 马洪宾*
> Apache Kylin: http://kylin.io
> Github: https://github.com/binmahone
>

Re: Kylin vs Druid

Posted by hongbin ma <ma...@apache.org>.
thanks jiangxu

i have included your answer as a FAQ:
https://github.com/KylinOLAP/Kylin/blob/staging/docs/MISC/FAQ%20on%20Kylin%20Installation%20and%20Usage.md#what-is-the-difference-between-kylin-and-druid

On Sat, Mar 21, 2015 at 8:30 AM, 蒋旭 <ji...@qq.com> wrote:

> Let me give some simple explanations as below. Since I am more familiar
> with Kylin, these explanations may not be 100% accurate. :-)
>
>
> 1. Druid is more suitable for realtime analysis. Kylin is more focus on
> OLAP case.
>
>
> 2. Druid has good integration with Kafka as realtime streaming. The
> realtime capability of Kylin is still under development.
>
>
> 3. Druid use bitmap index as internal data structure. Kylin use bitmap
> index for realtime data and molap cube for historical data.
>
>
> 4. Kylin provide ANSI SQL. Druid provide specific query language.
>
>
> 5. Druid has limitation on table join. Kylin support star schema.
>
>
> 6. Kylin has good intergration with BI tools, such as Tableau or Excel.
> Druid can't integrate with existed BI tools.
>
>
> 7. Since Kylin support molap cube, it has very good performance for
> complex query on billion level data set. Since Druid need to scan the full
> index, the performance may be hurt if the data set and query range is too
> big.
>
>
> Thanks
> Jiang Xu
>
>
> ------------------ 原始邮件 ------------------
> 发件人: ワンレイ(ケビンKevin) <wa...@simplex-cn.co.jp>
> 发送时间: 2015年03月20日 14:44
> 收件人: dev <de...@kylin.incubator.apache.org>
> 抄送: shigira <sh...@simplex-cn.co.jp>
> 主题: Kylin vs Druid
>
>
>
> Hi,
>
> A quick question, how do you look at Kylin differently from Druid, another
> tool that is built for providing large quantity of analytic data with low
> latency? Are two tools to solve the similar problem with different
> approaches?
>
> Thanks!
> Kevin
>



-- 
Regards,

*Bin Mahone | 马洪宾*
Apache Kylin: http://kylin.io
Github: https://github.com/binmahone

回复:Kylin vs Druid

Posted by 蒋旭 <ji...@qq.com>.
Let me give some simple explanations as below. Since I am more familiar with Kylin, these explanations may not be 100% accurate. :-)


1. Druid is more suitable for realtime analysis. Kylin is more focus on OLAP case. 


2. Druid has good integration with Kafka as realtime streaming. The realtime capability of Kylin is still under development. 


3. Druid use bitmap index as internal data structure. Kylin use bitmap index for realtime data and molap cube for historical data. 


4. Kylin provide ANSI SQL. Druid provide specific query language.


5. Druid has limitation on table join. Kylin support star schema.


6. Kylin has good intergration with BI tools, such as Tableau or Excel. Druid can't integrate with existed BI tools.


7. Since Kylin support molap cube, it has very good performance for complex query on billion level data set. Since Druid need to scan the full index, the performance may be hurt if the data set and query range is too big.


Thanks
Jiang Xu


------------------ 原始邮件 ------------------
发件人: ワンレイ(ケビンKevin) <wa...@simplex-cn.co.jp>
发送时间: 2015年03月20日 14:44
收件人: dev <de...@kylin.incubator.apache.org>
抄送: shigira <sh...@simplex-cn.co.jp>
主题: Kylin vs Druid



Hi,

A quick question, how do you look at Kylin differently from Druid, another
tool that is built for providing large quantity of analytic data with low
latency? Are two tools to solve the similar problem with different
approaches?

Thanks!
Kevin