You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@kylin.apache.org by li...@apache.org on 2019/04/19 05:58:35 UTC
svn commit: r1857785 [3/3] - in /kylin/site: ./ blog/ blog/2019/04/19/
blog/2019/04/19/release-v3.0.0-alpha/ cn/blog/2019/04/ cn/blog/2019/04/19/
cn/blog/2019/04/19/release-v3.0.0-alpha/ docs30/tutorial/
Modified: kylin/site/feed.xml
URL: http://svn.apache.org/viewvc/kylin/site/feed.xml?rev=1857785&r1=1857784&r2=1857785&view=diff
==============================================================================
--- kylin/site/feed.xml (original)
+++ kylin/site/feed.xml Fri Apr 19 05:58:35 2019
@@ -19,11 +19,123 @@
<description>Apache Kylin Home</description>
<link>http://kylin.apache.org/</link>
<atom:link href="http://kylin.apache.org/feed.xml" rel="self" type="application/rss+xml"/>
- <pubDate>Thu, 18 Apr 2019 06:59:32 -0700</pubDate>
- <lastBuildDate>Thu, 18 Apr 2019 06:59:32 -0700</lastBuildDate>
+ <pubDate>Thu, 18 Apr 2019 22:44:29 -0700</pubDate>
+ <lastBuildDate>Thu, 18 Apr 2019 22:44:29 -0700</lastBuildDate>
<generator>Jekyll v2.5.3</generator>
<item>
+ <title>Apache Kylin v3.0.0-alpha Release Announcement</title>
+ <description><p>The Apache Kylin community is pleased to announce the release of Apache Kylin v3.0.0-alpha.</p>
+
+<p>Apache Kylin is an open source Distributed Analytics Engine designed to provide SQL interface and multi-dimensional analysis (OLAP) on Big Data supporting extremely large datasets.</p>
+
+<p>This is the first release of the new generation v3.x, the main feature introduced is the Real-time OLAP. All of the changes can be found in the <a href="/docs/release_notes.html">release notes</a>. Here we just highlight the main features.</p>
+
+<h1 id="important-features">Important features</h1>
+
+<h3 id="kylin-3654---real-time-olap">KYLIN-3654 - Real-time OLAP</h3>
+<p>With the newly introduced Kylin real-time receiver and coordinator components, Kylin can implement a millisecond-level data preparation delay for streaming data from sources like Apache Kafka. This means since v3.0 on, Kylin can support sub-second level OLAP over historical batch data, near real-time streaming as well as real-time streaming. The user can use one OLAP platform to serve different scenarios. This solution has been deployed and verified in early adopters like eBay since 2018. For how to enable it, please refer to <a href="/docs30/tutorial/realtime_olap.html">this tutorial</a>.</p>
+
+<h3 id="kylin-3795---submit-spark-jobs-via-apache-livy">KYLIN-3795 - Submit Spark jobs via Apache Livy</h3>
+<p>This feature allows the administrator to configure Kylin to integrate with Apache Livy (incubating) for Spark job submissions. The Spark job is submitted to the Livy Server through Livyâs REST API, instead of starting the Spark Driver process in local, which facilitates the management and monitoring of the Spark resources, and also releases the pressure of the nodes where the Kylin job server is running.</p>
+
+<h3 id="kylin-3820---a-curator-based-job-scheduler">KYLIN-3820 - A curator-based job scheduler</h3>
+<p>A new job scheduler is added to automatically discover the Kylin nodes and do an automatic leader selection among them (only the leader will submit jobs). With this feature, you can easily deploy and scale out Kylin nodes without manually update the node address in <code class="highlighter-rouge">kylin.properties</code> and restart Kylin to take effective.</p>
+
+<h1 id="other-enhancements">Other enhancements</h1>
+
+<h3 id="kylin-3716---fastthreadlocal-replaces-threadlocal">KYLIN-3716 - FastThreadLocal replaces ThreadLocal</h3>
+<p>Using FastThreadLocal instead of ThreadLocal can improve Kylinâs overall performance to some extent.</p>
+
+<h3 id="kylin-3867---enable-jdbc-to-use-key-store--trust-store-for-https-connection">KYLIN-3867 - Enable JDBC to use key store &amp; trust store for https connection</h3>
+<p>By using HTTPS, the authentication information used by JDBC is protected, making Kylin more secure.</p>
+
+<h3 id="kylin-3905---enable-shrunken-dictionary-default">KYLIN-3905 - Enable shrunken dictionary default</h3>
+<p>By default, the shrunken dictionary is enabled, and the precise counting scene for high cardinal dimensions can significantly reduce the build time.</p>
+
+<h3 id="kylin-3839---storage-clean-up-after-the-refreshing-and-deleting-a-segment">KYLIN-3839 - Storage clean up after the refreshing and deleting a segment</h3>
+<p>Clear unnecessary data files in a timely manner</p>
+
+<p><strong>Download</strong></p>
+
+<p>To download Apache Kylin v3.0.0-alpha source code or binary package, visit the <a href="http://kylin.apache.org/download">download</a> page.</p>
+
+<p><strong>Upgrade</strong></p>
+
+<p>Follow the <a href="/docs/howto/howto_upgrade.html">upgrade guide</a>.</p>
+
+<p><strong>Feedback</strong></p>
+
+<p>If you face issue or question, please send mail to Apache Kylin dev or user mailing list:Â dev@kylin.apache.org , user@kylin.apache.org; Before sending, please make sure you have subscribed the mailing list by dropping an email to dev-subscribe@kylin.apache.org or user-subscribe@kylin.apache.org.</p>
+
+<p><em>Great thanks to everyone who contributed!</em></p>
+</description>
+ <pubDate>Fri, 19 Apr 2019 13:00:00 -0700</pubDate>
+ <link>http://kylin.apache.org/blog/2019/04/19/release-v3.0.0-alpha/</link>
+ <guid isPermaLink="true">http://kylin.apache.org/blog/2019/04/19/release-v3.0.0-alpha/</guid>
+
+
+ <category>blog</category>
+
+ </item>
+
+ <item>
+ <title>Apache Kylin v3.0.0-alpha åå¸</title>
+ <description><p>è¿æ¥ Apache Kylin 社åºå¾é«å
´å°å®£å¸ï¼Apache Kylin v3.0.0-alpha æ£å¼åå¸ã</p>
+
+<p>Apache Kylin æ¯ä¸ä¸ªå¼æºçåå¸å¼åæå¼æï¼æ¨å¨ä¸ºæ大æ°æ®éæä¾ SQL æ¥å£åå¤ç»´åæï¼OLAPï¼çè½åã</p>
+
+<p>è¿æ¯ Kylin ä¸ä¸ä»£ v3.x ç第ä¸ä¸ªåå¸çæ¬ï¼ç¨äºæ©æé¢è§ï¼ä¸»è¦çåè½æ¯å®æ¶ ï¼Real-timeï¼ OLAPãå®æ´çæ¹å¨å表请åè§<a href="/docs/release_notes.html">release notes</a>ï¼è¿éæä¸äºä¸»è¦æ¹è¿å说æã</p>
+
+<h1 id="section">éè¦æ°åè½</h1>
+
+<h3 id="kylin-3654----olap">KYLIN-3654 - å®æ¶ OLAP</h3>
+<p>éçå¼å
¥æ°ç real-time receiver å coordinator ç»ä»¶ï¼Kylin è½å¤å®ç°æ¯«ç§çº§å«çæ°æ®åå¤å»¶è¿ï¼æ°æ®æºæ¥èªæµå¼æ°æ®å¦ Apache Kafkaãè¿æå³çï¼ä» v3.0 å¼å§ï¼Kylin æ¢è½å¤æ¯æåå²æ¹éæ°æ®ç OLAPï¼ä¹æ¯æ对æµå¼æ°æ®çåå®æ¶ï¼Near real-timeï¼ä»¥åå®å
¨å®æ¶(real-time)åæãç¨æ·å¯ä»¥ä½¿ç¨ä¸ä¸ª OLAP å¹³å°æ¥æå¡ä¸åç使ç¨åºæ¯ãæ¤æ¹æ¡å·²ç»å¨æ©æç¨æ·å¦ eBay å¾å°é¨ç½²åéªè¯ãå
³äºå¦ä½ä½¿ç¨æ¤åè½ï¼è¯·åè<a href="/docs30/tutorial/realtime_olap.html">æ¤æ
ç¨</a>ã</p>
+
+<h3 id="kylin-3795----apache-livy--spark-">KYLIN-3795 - éè¿ Apache Livy é交 Spark ä»»å¡</h3>
+<p>è¿ä¸ªåè½å
许管çå为 Kylin é
ç½®ä½¿ç¨ Apache Livy (incubating) æ¥å®æä»»å¡çé交ãSpark ä½ä¸çæ交éè¿ Livy ç REST API æ¥æ交ï¼èæ éå¨æ¬å°å¯å¨ Spark Driver è¿ç¨ï¼ä»èæ¹ä¾¿å¯¹ Spark èµæºç管ççæ§ï¼åæ¶ä¹éä½å¯¹ Kylin ä»»å¡è¿ç¨æå¨èç¹çååã</p>
+
+<h3 id="kylin-3820----curator-">KYLIN-3820 - åºäº Curator çä»»å¡èç¹åé
åæå¡åç°</h3>
+<p>æ°å¢ä¸ç§åºäºApache Zookeeper å Curatorä½ä¸è°åº¦å¨ï¼å¯ä»¥èªå¨åç° Kylin èç¹ï¼å¹¶èªå¨åé
ä¸ä¸ªèç¹æ¥è¿è¡ä»»å¡ç管ç以åæ
éæ¢å¤ãæäºè¿ä¸ªåè½åï¼ç®¡çåå¯ä»¥æ´å 容æå°é¨ç½²åæ©å± Kylin èç¹ï¼èä¸åéè¦å¨ <code class="highlighter-rouge">kylin.properties</code> ä¸é
ç½®æ¯ä¸ª Kylin èç¹çå°å并éå¯ Kylin 以使ä¹çæã</p>
+
+<h1 id="section-1">å
¶å®æ¹è¿</h1>
+
+<h3 id="kylin-3716---fastthreadlocal--threadlocal">KYLIN-3716 - FastThreadLocal æ¿æ¢ ThreadLocal</h3>
+<p>ä½¿ç¨ Netty ä¸ç FastThreadLocal æ¿ä»£ JDK åçç ThreadLocalï¼å¯ä»¥ä¸å®ç¨åº¦ä¸æå Kylin å¨é«å¹¶åä¸çæ§è½ã</p>
+
+<h3 id="kylin-3867---enable-jdbc-to-use-key-store--trust-store-for-https-connection">KYLIN-3867 - Enable JDBC to use key store &amp; trust store for https connection</h3>
+<p>éè¿ä½¿ç¨HTTPSï¼ä¿æ¤äºJDBC使ç¨ç身份éªè¯ä¿¡æ¯ï¼ä½¿å¾Kylinæ´å å®å
¨</p>
+
+<h3 id="kylin-3905---enable-shrunken-dictionary-default">KYLIN-3905 - Enable shrunken dictionary default</h3>
+<p>é»è®¤å¼å¯ shrunken dictionaryï¼é对é«åºç»´è¿è¡ç²¾ç¡®å»éçåºæ¯ï¼å¯ä»¥æ¾èåå°æ建ç¨æ¶ã</p>
+
+<h3 id="kylin-3839---storage-clean-up-after-the-refreshing-and-deleting-a-segment">KYLIN-3839 - Storage clean up after the refreshing and deleting a segment</h3>
+<p>æ´å åæ¶å°æ¸
é¤ä¸å¿
è¦çæ°æ®æ件</p>
+
+<p><strong>ä¸è½½</strong></p>
+
+<p>è¦ä¸è½½Apache Kylin æºä»£ç æäºè¿å¶å
ï¼è¯·è®¿é®<a href="/download">ä¸è½½é¡µé¢</a> page.</p>
+
+<p><strong>å级</strong></p>
+
+<p>åè<a href="/docs/howto/howto_upgrade.html">å级æå</a>.</p>
+
+<p><strong>åé¦</strong></p>
+
+<p>å¦ææ¨éå°é®é¢æçé®ï¼è¯·åéé®ä»¶è³ Apache Kylin dev æ user é®ä»¶å表ï¼dev@kylin.apache.orgï¼user@kylin.apache.org; å¨åéä¹åï¼è¯·ç¡®ä¿æ¨å·²éè¿åéçµåé®ä»¶è³ dev-subscribe@kylin.apache.org æ user-subscribe@kylin.apache.org 订é
äºé®ä»¶å表ã</p>
+
+<p><em>é常æè°¢ææè´¡ç®Apache Kylinçæå!</em></p>
+</description>
+ <pubDate>Fri, 19 Apr 2019 13:00:00 -0700</pubDate>
+ <link>http://kylin.apache.org/cn/blog/2019/04/19/release-v3.0.0-alpha/</link>
+ <guid isPermaLink="true">http://kylin.apache.org/cn/blog/2019/04/19/release-v3.0.0-alpha/</guid>
+
+
+ <category>blog</category>
+
+ </item>
+
+ <item>
<title>Real-time Streaming Design in Apache Kylin</title>
<description><h2 id="why-build-real-time-streaming-in-kylin">Why Build Real-time Streaming in Kylin</h2>
<p>The real-time streaming feature is contributed by eBay big data team in Kylin 3.0, the purpose we build real-time streaming is:</p>
@@ -854,70 +966,6 @@ Graphic 10 Process of Querying Cube</
</item>
<item>
- <title>Apache Kylin v2.5.0 æ£å¼åå¸</title>
- <description><p>è¿æ¥Apache Kylin 社åºå¾é«å
´å°å®£å¸ï¼Apache Kylin 2.5.0 æ£å¼åå¸ã</p>
-
-<p>Apache Kylin æ¯ä¸ä¸ªå¼æºçåå¸å¼åæå¼æï¼æ¨å¨ä¸ºæ大æ°æ®éæä¾ SQL æ¥å£åå¤ç»´åæï¼OLAPï¼çè½åã</p>
-
-<p>è¿æ¯ç»§2.4.0 åçä¸ä¸ªæ°åè½çæ¬ã该çæ¬å¼å
¥äºå¾å¤æä»·å¼çæ¹è¿ï¼å®æ´çæ¹å¨å表请åè§<a href="https://kylin.apache.org/docs/release_notes.html">release notes</a>ï¼è¿éæä¸äºä¸»è¦æ¹è¿å说æï¼</p>
-
-<h3 id="all-in-spark--cubing-">All-in-Spark ç Cubing å¼æ</h3>
-<p>Kylin ç Spark å¼æå°ä½¿ç¨ Spark è¿è¡ cube 计ç®ä¸çææåå¸å¼ä½ä¸ï¼å
æ¬è·åå个维度çä¸åå¼ï¼å° cuboid æ件转æ¢ä¸º HBase HFileï¼å并 segmentï¼å并è¯å
¸çãé»è®¤ç Spark é
ç½®ä¹ç»è¿ä¼åï¼ä½¿å¾ç¨æ·å¯ä»¥è·å¾å¼ç®±å³ç¨çä½éªãç¸å
³å¼åä»»å¡æ¯ KYLIN-3427, KYLIN-3441, KYLIN-3442.</p>
-
-<p>Spark ä»»å¡ç®¡çä¹æææ¹è¿ï¼ä¸æ¦ Spark ä»»å¡å¼å§è¿è¡ï¼æ¨å°±å¯ä»¥å¨Webæ§å¶å°ä¸è·å¾ä½ä¸é¾æ¥ï¼å¦ææ¨ä¸¢å¼è¯¥ä½ä¸ï¼Kylin å°ç«å»ç»æ¢ Spark ä½ä¸ä»¥åæ¶éæ¾èµæºï¼å¦æéæ°å¯å¨ Kylinï¼å®å¯ä»¥ä»ä¸ä¸ä¸ªä½ä¸æ¢å¤ï¼èä¸æ¯éæ°æ交æ°ä½ä¸.</p>
-
-<h3 id="mysql--kylin-">MySQL å Kylin å
æ°æ®çåå¨</h3>
-<p>å¨è¿å»ï¼HBase æ¯ Kylin å
æ°æ®åå¨çå¯ä¸éæ©ã å¨æäºæ
åµä¸ HBaseä¸éç¨ï¼ä¾å¦ä½¿ç¨å¤ä¸ª HBase é群æ¥ä¸º Kylin æä¾è·¨åºåçé«å¯ç¨ï¼è¿éå¤å¶ç HBase é群æ¯åªè¯»çï¼æ以ä¸è½åå
æ°æ®åå¨ãç°å¨æ们å¼å
¥äº MySQL Metastore 以满足è¿ç§éæ±ãæ¤åè½ç°å¨å¤äºæµè¯é¶æ®µãæ´å¤å
容åè§ KYLIN-3488ã</p>
-
-<h3 id="hybrid-model-">Hybrid model å¾å½¢çé¢</h3>
-<p>Hybrid æ¯ä¸ç§ç¨äºç»è£
å¤ä¸ª cube çé«çº§æ¨¡åã å®å¯ç¨äºæ»¡è¶³ cube ç schema è¦åçæ¹åçæ
åµãè¿ä¸ªåè½è¿å»æ²¡æå¾å½¢çé¢ï¼å æ¤åªæä¸å°é¨åç¨æ·ç¥éå®ãç°å¨æä»¬å¨ Web çé¢ä¸å¼å¯äºå®ï¼ä»¥ä¾¿æ´å¤ç¨æ·å¯ä»¥å°è¯ã</p>
-
-<h3 id="cube-planner">é»è®¤å¼å¯ Cube planner</h3>
-<p>Cube planner å¯ä»¥æ大å°ä¼å cube ç»æï¼åå°æ建ç cuboid æ°éï¼ä»èèç计ç®/åå¨èµæºå¹¶æé«æ¥è¯¢æ§è½ãå®æ¯å¨v2.3ä¸å¼å
¥çï¼ä½é»è®¤æ
åµä¸æ²¡æå¼å¯ã为äºè®©æ´å¤ç¨æ·çå°å¹¶å°è¯å®ï¼æ们é»è®¤å¨v2.5ä¸å¯ç¨å®ã ç®æ³å°å¨ç¬¬ä¸æ¬¡æ建 segment çæ¶åï¼æ ¹æ®æ°æ®ç»è®¡èªå¨ä¼å cuboid éå.</p>
-
-<h3 id="segment-">æ¹è¿ç Segment åªæ</h3>
-<p>Segmentï¼ååºï¼ä¿®åªå¯ä»¥ææå°åå°ç£çåç½ç»I / Oï¼å æ¤å¤§å¤§æé«äºæ¥è¯¢æ§è½ã è¿å»ï¼Kylin åªæååºå (partition date column) çå¼è¿è¡ segment çä¿®åªã å¦ææ¥è¯¢ä¸æ²¡æå°ååºåä½ä¸ºè¿æ»¤æ¡ä»¶ï¼é£ä¹ä¿®åªå°ä¸èµ·ä½ç¨ï¼ä¼æ«æææsegmentã.<br />
-ç°å¨ä»v2.5å¼å§ï¼Kylin å°å¨ segment 级å«è®°å½æ¯ä¸ªç»´åº¦çæå°/æ大å¼ã å¨æ«æ segment ä¹åï¼ä¼å°æ¥è¯¢çæ¡ä»¶ä¸æå°/æ大索å¼è¿è¡æ¯è¾ã å¦æä¸å¹é
ï¼å°è·³è¿è¯¥ segmentã æ£æ¥KYLIN-3370äºè§£æ´å¤ä¿¡æ¯ã</p>
-
-<h3 id="yarn-">å¨ YARN ä¸å并åå
¸</h3>
-<p>å½ segment å并æ¶ï¼å®ä»¬çè¯å
¸ä¹éè¦å并ãå¨è¿å»ï¼åå
¸å并åçå¨ Kylin ç JVM ä¸ï¼è¿éè¦ä½¿ç¨å¤§éçæ¬å°å
åå CPU èµæºã å¨æ端æ
åµä¸ï¼å¦ææå 个并åä½ä¸ï¼ï¼å¯è½ä¼å¯¼è´ Kylin è¿ç¨å´©æºã å æ¤ï¼ä¸äºç¨æ·ä¸å¾ä¸ä¸º Kylin ä»»å¡èç¹åé
æ´å¤å
åï¼æè¿è¡å¤ä¸ªä»»å¡èç¹ä»¥å¹³è¡¡å·¥ä½è´è½½ã<br />
-ç°å¨ä»v2.5å¼å§ï¼Kylin å°æè¿é¡¹ä»»å¡æäº¤ç» Hadoop MapReduce å Sparkï¼è¿æ ·å°±å¯ä»¥è§£å³è¿ä¸ªç¶é¢é®é¢ã æ¥çKYLIN-3471äºè§£æ´å¤ä¿¡æ¯.</p>
-
-<h3 id="cube-">æ¹è¿ä½¿ç¨å
¨å±åå
¸ç cube æ建æ§è½</h3>
-<p>å
¨å±åå
¸ (Global Dictionary) æ¯ bitmap 精确å»é计æ°çå¿
è¦æ¡ä»¶ãå¦æå»éåå
·æé常é«çåºæ°ï¼å GD å¯è½é常大ãå¨ cube æ建é¶æ®µï¼Kylin éè¦éè¿ GD å°éæ´æ°å¼è½¬æ¢ä¸ºæ´æ°ã尽管 GD 已被åæå¤ä¸ªåçï¼å¯ä»¥åå¼å è½½å°å
åï¼ä½æ¯ç±äºå»éåçå¼æ¯ä¹±åºçãKylin éè¦åå¤è½½å
¥åè½½åº(swap in/out)åçï¼è¿ä¼å¯¼è´æ建任å¡é常ç¼æ
¢ã<br />
-该å¢å¼ºåè½å¼å
¥äºä¸ä¸ªæ°æ¥éª¤ï¼ä¸ºæ¯ä¸ªæ°æ®åä»å
¨å±åå
¸ä¸æ建ä¸ä¸ªç¼©å°çåå
¸ã éåæ¯ä¸ªä»»å¡åªéè¦å 载缩å°çåå
¸ï¼ä»èé¿å
é¢ç¹çè½½å
¥åè½½åºãæ§è½å¯ä»¥æ¯ä»¥åå¿«3åãæ¥ç KYLIN-3491 äºè§£æ´å¤ä¿¡æ¯.</p>
-
-<h3 id="topn-count-distinct--cube-">æ¹è¿å« TOPN, COUNT DISTINCT ç cube 大å°ç估计</h3>
-<p>Cube ç大å°å¨æ建æ¶æ¯é¢å
估计çï¼å¹¶è¢«åç»å 个æ¥éª¤ä½¿ç¨ï¼ä¾å¦å³å® MR / Spark ä½ä¸çååºæ°ï¼è®¡ç® HBase region åå²çãå®çåç¡®ä¸å¦ä¼å¯¹æ建æ§è½äº§çå¾å¤§å½±åã å½åå¨ COUNT DISTINCTï¼TOPN ç度éæ¶åï¼å 为å®ä»¬ç大å°æ¯çµæ´»çï¼å æ¤ä¼°è®¡å¼å¯è½è·çå®å¼æå¾å¤§åå·®ã å¨è¿å»ï¼ç¨æ·éè¦è°æ´è¥å¹²ä¸ªåæ°ä»¥ä½¿å°ºå¯¸ä¼°è®¡æ´æ¥è¿å®é
尺寸ï¼è¿å¯¹æ®éç¨æ·æç¹å°é¾ã<br />
-ç°å¨ï¼Kylin å°æ ¹æ®æ¶éçç»è®¡ä¿¡æ¯èªå¨è°æ´å¤§å°ä¼°è®¡ãè¿å¯ä»¥ä½¿ä¼°è®¡å¼ä¸å®é
大å°æ´æ¥è¿ãæ¥ç KYLIN-3453 äºè§£æ´å¤ä¿¡æ¯ã</p>
-
-<h3 id="hadoop-30hbase-20">æ¯æHadoop 3.0/HBase 2.0</h3>
-<p>Hadoop 3å HBase 2å¼å§è¢«è®¸å¤ç¨æ·éç¨ãç°å¨ Kylin æä¾ä½¿ç¨æ°ç Hadoop å HBase API ç¼è¯çæ°äºè¿å¶å
ãæ们已ç»å¨ Hortonworks HDP 3.0 å Cloudera CDH 6.0 ä¸è¿è¡äºæµè¯</p>
-
-<p><strong>ä¸è½½</strong></p>
-
-<p>è¦ä¸è½½Apache Kylin v2.5.0æºä»£ç æäºè¿å¶å
ï¼è¯·è®¿é®<a href="http://kylin.apache.org/download">ä¸è½½é¡µé¢</a> .</p>
-
-<p><strong>å级</strong></p>
-
-<p>åè<a href="/docs/howto/howto_upgrade.html">å级æå</a>.</p>
-
-<p><strong>åé¦</strong></p>
-
-<p>å¦ææ¨éå°é®é¢æçé®ï¼è¯·åéé®ä»¶è³ Apache Kylin dev æ user é®ä»¶å表ï¼dev@kylin.apache.orgï¼user@kylin.apache.org; å¨åéä¹åï¼è¯·ç¡®ä¿æ¨å·²éè¿åéçµåé®ä»¶è³ dev-subscribe@kylin.apache.org æ user-subscribe@kylin.apache.org订é
äºé®ä»¶å表ã</p>
-
-<p><em>é常æè°¢ææè´¡ç®Apache Kylinçæå!</em></p>
-</description>
- <pubDate>Thu, 20 Sep 2018 13:00:00 -0700</pubDate>
- <link>http://kylin.apache.org/cn/blog/2018/09/20/release-v2.5.0/</link>
- <guid isPermaLink="true">http://kylin.apache.org/cn/blog/2018/09/20/release-v2.5.0/</guid>
-
-
- <category>blog</category>
-
- </item>
-
- <item>
<title>Apache Kylin v2.5.0 Release Announcement</title>
<description><p>The Apache Kylin community is pleased to announce the release of Apache Kylin v2.5.0.</p>
@@ -990,287 +1038,63 @@ Graphic 10 Process of Querying Cube</
</item>
<item>
- <title>Use Star Schema Benchmark for Apache Kylin</title>
- <description><h2 id="background">Background</h2>
-
-<p>For many Apache Kylin users, when deploying Kylin in the production environment, how to measure Kylinâs performance before delivering to the business is a problem. A performance benchmark can help to find the potential performance issues, so you can tune the configuration to improve the overall performance. The tunning may include Kylinâs own Job and Query, concurrent building of Cubes, HBase write and read, MapReduce or Spark parameters and more.</p>
-
-<h2 id="ssb-introduction">SSB Introduction</h2>
-<p>Kyligence Inc provides an SSB (Star Schema Benchmark) project called <a href="https://github.com/Kyligence/ssb-kylin">ssb-kylin</a> on github, which is modified from the TPC-H benchmark, and specifically targeted to test tools in the star model OLAP scenario.</p>
-
-<p>The test process generates 5 tables, and the data volume can be adjusted by parameters. The table structure of SSB is shown below:</p>
-
-<p><img src="/images/blog/1. The table structure of SSB.png" alt="" /></p>
-
-<p>The table âlineorderâ is the fact table, the other four are dimension tables. Each dimension table is associated with the fact table by the primary key, which is a standard star schema.</p>
-
-<p>The environment for this test is CDH 5.13.3, which enables authentication and authorization of Kerberos and OpenLDAP, and uses Sentry to provide fine-grained, role-based authorization and multi-tenant management. However, the official âssb-kylinâ does not involve the processing of permissions and authentication, so I have slightly modified it. For details, see my fork <a href="https://github.com/jiangshouzhuang/ssb-kylin">jiangshouzhuang/ssb-kylin</a>.</p>
-
-<h2 id="prerequisites">Prerequisites</h2>
-
-<p>** Here is a description of the Kylin deployment:**<br />
-ãã1. Kylin deploys integrated OpenLDAP user unified authentication management<br />
-ãã2. Add Kylin deployment user kylin_manager_user in OpenLDAP (user group is kylin_manager_group)<br />
-ãã3. The Kylin version is apache-kylin-2.4.0<br />
-ãã4. Kylin Cluster configuration (VM):<br />
-ããKylin Job 1 node: 16GB, 8Cores<br />
-ããKylin Query 2 nodes: 32GB, 8Cores<br />
-<strong>A few points before SSB pressure measurement:</strong><br />
-1 Create a database named ssb in the Hive database.</p>
-<pre name="code" class="java">
-# Log in to the hive database as a super administrator.
-Create database SSB;
-CREATE ROLE ssb_write_role;
-GRANT ALL ON DATABASE ssb TO ROLE ssb_write_role;
-GRANT ROLE ssb_write_role TO GROUP ssb_write_group;
-# Then add kylin_manager_user to kylin_manager_group in OpenLDAP, so kylin_manager_user has access to the ssb database.
-</pre>
-<p>2 Assign HDFS directory /user/kylin_manager_user read and write permissions to kylin_manager_user user.<br />
-3 Configure the HADOOP_STREAMING_JAR environment variable under the kylin_manager_user user home directory.<br />
-<code class="highlighter-rouge">
-Export HADOOP_STREAMING_JAR=/opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/hadoop-streaming.jar
-</code></p>
-
-<h2 id="download-the-ssb-tool-and-compile">Download the SSB tool and compile</h2>
-
-<p>You can quickly download and compile the ssb test tool by entering the following command in the linux terminal.</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>git clone https://github.com/jiangshouzhuang/ssb-kylin.git
-cd ssb-kylin
-cd ssb-benchmark
-make clean
-make
-</code></pre>
-</div>
-
-<h2 id="adjust-the-ssb-parameters">Adjust the SSB parameters</h2>
-
-<p>In the ssb-kylin project, there is a ssb.conf file below the bin directory, which defines the base data volume of the fact table and the dimension table. When we generate the amount of test data, we can specify the size of the scale so that the actual data is base * scale.</p>
-
-<p>Part of the ssb.conf file is:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code> # customer base, default value is 30,000
- customer_base = 30000
- # part base, default value is 200,000
- part_base = 200000
- # supply base, default value is 2,000
- supply_base = 2000
- # date base (days), default value is 2,556
- date_base = 2556
- # lineorder base (purchase record), default value is 6,000,000
- lineorder_base = 6000000
-</code></pre>
-</div>
-
-<p>Of course, the above base parameters can be adjusted according to their actual needs, I use the default parameters.<br />
-In the ssb.conf file, there are some parameters as follows.</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code># manufacturer max. The value range is (1 .. manu_max)
-manu_max = 5
-# category max. The value range is (1 .. cat_max)
-cat_max = 5
-# brand max. The value range is (1 .. brand_max)
-brand_max = 40
-</code></pre>
-</div>
-
-<p><strong>The explanation is as follows:</strong> <br />
-manu_max, cat_max and brand_max are used to define hierarchical scale. For example, manu_max=10, cat_max=10, and brand_max=10 refer to a total of 10 manufactures, and each manufactures has a maximum of 10 category parts, and each category has up to 10 brands. Therefore, the cardinality of manufacture is 10, the cardinality of category is 100, and the cardinality of brand is 1000.</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code># customer: num of cities per country, default value is 100
-cust_city_max = 9
-# supplier: num of cities per country, default value is 100
-supp_city_max = 9
-</code></pre>
-</div>
-
-<p><strong>The explanation is as follows:</strong> <br />
-cust_city_max and supp_city_max are used to define the number of city for each country in customer and supplier tables. If the total number of country is 30, and cust_city_max=100, supp_city_max=10, then the customer table will have 3000 different city, and the supplier table will have 300 different city.</p>
-
-<p><strong>Prompt:</strong><br />
-In this pressure test, the resources allocated by Yarn are used to generate test data. If the memory problems are encountered in the process of generating the data, increase the memory size of the Yarn allocation of container.</p>
-
-<h2 id="generate-test-data">Generate test data</h2>
-
-<p>Before running the <code class="highlighter-rouge">ssb-kylin/bin/run.sh</code> script, explain several points to run.sh:<br />
-1 configuring HDFS_BASE_DIR as the path to table data, because I give kylin_manager_user the right to read and write to /user/kylin_manager_user directory, so configure here:</p>
-<pre name="code" class="java">
-HDFS_BASE_DIR=/user/kylin_manager_user/ssb
-</pre>
-<p>The temporary and actual data will be generated under this directory when you run run.sh.<br />
-2 configure the LDAP user and password for deploying Kylin, and operate KeyTab files such as HDFS.</p>
-<pre name="code" class="java">
-KYLIN_INSTALL_USER=kylin_manager_user
-KYLIN_INSTALL_USER_PASSWD=xxxxxxxx
-KYLIN_INSTALL_USER_KEYTAB=/home/${KYLIN_INSTALL_USER}/keytab/${KYLIN_INSTALL_USER}.keytab
-</pre>
-<p>3 configure the way that beeline accesses the hive database.</p>
-<pre name="code" class="java">
-BEELINE_URL=jdbc:hive2://hiveserve2_ip:10000
-HIVE_BEELINE_COMMAND="beeline -u ${BEELINE_URL} -n ${KYLIN_INSTALL_USER} -p
-${KYLIN_INSTALL_USER_PASSWD} -d org.apache.hive.jdbc.HiveDriver"
-</pre>
-<p>If your CDH or other big data platform is not using beeline, but hive cli, please modify it yourself.<br />
-Once everything is ready, we start running the program and generate test data:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>cd ssb-kylin
-bin/run.sh --scale 20
-</code></pre>
-</div>
-
-<p>We set the scale to 20, the program will run for a while, the maximum lineorder table data has more than 100 million. After the program is executed, we look at the tables in the hive database and the amount of data:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>use ssb;
-show tables;
-select count(1) from lineorder;
-select count(1) from p_lineorder;
-</code></pre>
-</div>
-
-<p><img src="/images/blog/2.1 generated tables.png" alt="" /></p>
-
-<p><img src="/images/blog/2.2 the volume of data.png" alt="" /></p>
-
-<p>As you can see, a total of five tables and one view were created.</p>
-
-<h2 id="load-the-cubes-metadata-and-build-the-cube">Load the cubeâs metadata and build the cube</h2>
-
-<p>The ssb-kylin project has helped us build the project, model, and cube in advance. Just import the Kylin directly like the learn_kylin example. Cube Metadataâs directory is cubemeta, because our kylin integrates OpenLDAP, there is no ADMIN user, so the owner parameter in cubemeta/cube/ssb.json is set to null.<br />
-Execute the following command to import cubemeta:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>cd ssb-kylin
-$KYLIN_HOME/bin/metastore.sh restore cubemeta
-</code></pre>
-</div>
-
-<p>Then log in to Kylin and execute Reload Metadata operation. This creates new project, model and cube in Kylin. Before building cube, first Disable, then Purge, delete old temporary files.</p>
-
-<p>The results of building with MapReduce are as follows:</p>
-
-<p><img src="/images/blog/3 build with mapReduce.png" alt="" /></p>
-
-<p>Here I test the performance of Spark to build Cube again, disable the previously created Cube, and then Purge. Since the Cube is used by Purge, the useless HBase tables and HDFS files need to be deleted. Here, manually clean up the junk files. First execute the following command:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>${KYLIN_HOME}/bin/kylin.sh org.apache.kylin.tool.StorageCleanupJob --delete false
-</code></pre>
-</div>
-
-<p>Then check whether the listed HBase table and the HDFS file are useless. After confirming the error, perform the delete operation:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>${KYLIN_HOME}/bin/kylin.sh org.apache.kylin.tool.StorageCleanupJob --delete true
-</code></pre>
-</div>
-
-<p>When using Spark to build a cube, it consumes a lot of memory. After all, using memory resources improves the speed of cube building. Here I will list some of the parameters of Spark in the kylin.properties configuration file:</p>
-
-<div class="highlighter-rouge"><pre class="highlight"><code>kylin.engine.spark-conf.spark.master=yarn
-kylin.engine.spark-conf.spark.submit.deployMode=cluster
-kylin.engine.spark-conf.spark.yarn.queue=root.kylin_manager_group
-# config Dynamic resource allocation
-kylin.engine.spark-conf.spark.dynamicAllocation.enabled=true
-kylin.engine.spark-conf.spark.dynamicAllocation.minExecutors=10
-kylin.engine.spark-conf.spark.dynamicAllocation.maxExecutors=1024
-kylin.engine.spark-conf.spark.dynamicAllocation.executorIdleTimeout=300
-
-kylin.engine.spark-conf.spark.shuffle.service.enabled=true
-kylin.engine.spark-conf.spark.shuffle.service.port=7337
-
-kylin.engine.spark-conf.spark.driver.memory=4G
-kylin.engine.spark-conf.spark.executor.memory=4G
-kylin.engine.spark-conf.spark.executor.cores=1
-kylin.engine.spark-conf.spark.network.timeout=600
-</code></pre>
-</div>
-
-<p>The above parameters can meet most of the requirements, so users basically do not need to configure when designing the Cube. Of course, if the situation is special, you can still set Spark-related tuning parameters at the Cube level.</p>
-
-<p>Before executing Spark to build a Cube, you need to set the Cube Engine value to Spark in Advanced Setting and then execute Build. After the construction is completed, the results are as follows:</p>
-
-<p><img src="/images/blog/4 build completely.png" alt="" /></p>
-
-<p>In contrast, the time for MapReduce and Spark to build Cube is as follows: (Scale=20):</p>
-
-<p><img src="/images/blog/5 the results of comparing Spark and MapReduce.png" alt="" /></p>
-
-<p>You can see that the speed of building is almost 1x faster. In fact, Spark has many other aspects of tuning (performance can be improved by 1-4 times and above), which is not involved here.</p>
-
-<h2 id="query">Query</h2>
-
-<p>Ssb-kylin provides 13 SSB query SQL lists. The query conditions may vary with the scale factor. You can modify the results according to the actual situation. The following examples show the test results in the case of scale 10 and 20:<br />
-The query result of Scale=10 is as follows:</p>
-
-<p><img src="/images/blog/6.1 scale 10.png" alt="" /></p>
-
-<p>The query result of Scale=20 is as follows:</p>
-
-<p><img src="/images/blog/6.2 scale 20.png" alt="" /></p>
-
-<p>As can be seen from the results, all the queries are completed within 1 s, which proves Apache Kylinâs subsecond query capability strongly. In addition, the average performance of the query did not decrease significantly as the amount of data doubled, which is also determined by the theory of Cube precomputation.</p>
-
-<p>Note: For details on each query statement, see the README.md description in the ssb-kylin project.</p>
+ <title>Apache Kylin v2.5.0 æ£å¼åå¸</title>
+ <description><p>è¿æ¥Apache Kylin 社åºå¾é«å
´å°å®£å¸ï¼Apache Kylin 2.5.0 æ£å¼åå¸ã</p>
-<p>At this point, the Kylinâs SSB pressure test is completed, but for you who are reading the article, everything is just beginning.</p>
+<p>Apache Kylin æ¯ä¸ä¸ªå¼æºçåå¸å¼åæå¼æï¼æ¨å¨ä¸ºæ大æ°æ®éæä¾ SQL æ¥å£åå¤ç»´åæï¼OLAPï¼çè½åã</p>
-<h2 id="references">References</h2>
+<p>è¿æ¯ç»§2.4.0 åçä¸ä¸ªæ°åè½çæ¬ã该çæ¬å¼å
¥äºå¾å¤æä»·å¼çæ¹è¿ï¼å®æ´çæ¹å¨å表请åè§<a href="https://kylin.apache.org/docs/release_notes.html">release notes</a>ï¼è¿éæä¸äºä¸»è¦æ¹è¿å说æï¼</p>
-<ol>
- <li>èå®å£®.<a href="https://juejin.im/post/5b46d0606fb9a04fd6593d31">å¦ä½ä½¿ç¨ Star Schema Benchmark åæµ Apache Kylin</a></li>
-</ol>
+<h3 id="all-in-spark--cubing-">All-in-Spark ç Cubing å¼æ</h3>
+<p>Kylin ç Spark å¼æå°ä½¿ç¨ Spark è¿è¡ cube 计ç®ä¸çææåå¸å¼ä½ä¸ï¼å
æ¬è·åå个维度çä¸åå¼ï¼å° cuboid æ件转æ¢ä¸º HBase HFileï¼å并 segmentï¼å并è¯å
¸çãé»è®¤ç Spark é
ç½®ä¹ç»è¿ä¼åï¼ä½¿å¾ç¨æ·å¯ä»¥è·å¾å¼ç®±å³ç¨çä½éªãç¸å
³å¼åä»»å¡æ¯ KYLIN-3427, KYLIN-3441, KYLIN-3442.</p>
-</description>
- <pubDate>Mon, 16 Jul 2018 05:28:00 -0700</pubDate>
- <link>http://kylin.apache.org/blog/2018/07/16/Star-Schema-Benchmark-on-Apache-Kylin/</link>
- <guid isPermaLink="true">http://kylin.apache.org/blog/2018/07/16/Star-Schema-Benchmark-on-Apache-Kylin/</guid>
-
-
- <category>blog</category>
-
- </item>
-
- <item>
- <title>Redash-Kylin plugin from Strikingly</title>
- <description><p>At strikingly, we are using Apache Kylin as our OLAP engine. Kylin is very powerful and it supports our big data business well. Weâve chosen Apache Kylin because it fits our demand: it handles a huge amount of data, undertakes multiple concurrent queries and has sub-second response time.</p>
+<p>Spark ä»»å¡ç®¡çä¹æææ¹è¿ï¼ä¸æ¦ Spark ä»»å¡å¼å§è¿è¡ï¼æ¨å°±å¯ä»¥å¨Webæ§å¶å°ä¸è·å¾ä½ä¸é¾æ¥ï¼å¦ææ¨ä¸¢å¼è¯¥ä½ä¸ï¼Kylin å°ç«å»ç»æ¢ Spark ä½ä¸ä»¥åæ¶éæ¾èµæºï¼å¦æéæ°å¯å¨ Kylinï¼å®å¯ä»¥ä»ä¸ä¸ä¸ªä½ä¸æ¢å¤ï¼èä¸æ¯éæ°æ交æ°ä½ä¸.</p>
-<p>Although we are mainly using Kylin to provide service to our customers, weâve decided to reuse the built result for internal purposes too. Kylin supports Business Intelligence tools like Apache Zeppelin and Tableau. With these BI tools we can provide insight and visualization about our data which will help making business decisions.</p>
+<h3 id="mysql--kylin-">MySQL å Kylin å
æ°æ®çåå¨</h3>
+<p>å¨è¿å»ï¼HBase æ¯ Kylin å
æ°æ®åå¨çå¯ä¸éæ©ã å¨æäºæ
åµä¸ HBaseä¸éç¨ï¼ä¾å¦ä½¿ç¨å¤ä¸ª HBase é群æ¥ä¸º Kylin æä¾è·¨åºåçé«å¯ç¨ï¼è¿éå¤å¶ç HBase é群æ¯åªè¯»çï¼æ以ä¸è½åå
æ°æ®åå¨ãç°å¨æ们å¼å
¥äº MySQL Metastore 以满足è¿ç§éæ±ãæ¤åè½ç°å¨å¤äºæµè¯é¶æ®µãæ´å¤å
容åè§ KYLIN-3488ã</p>
-<p>Other than those BI tools mentioned above, weâre using another similar application named Redash because:</p>
+<h3 id="hybrid-model-">Hybrid model å¾å½¢çé¢</h3>
+<p>Hybrid æ¯ä¸ç§ç¨äºç»è£
å¤ä¸ª cube çé«çº§æ¨¡åã å®å¯ç¨äºæ»¡è¶³ cube ç schema è¦åçæ¹åçæ
åµãè¿ä¸ªåè½è¿å»æ²¡æå¾å½¢çé¢ï¼å æ¤åªæä¸å°é¨åç¨æ·ç¥éå®ãç°å¨æä»¬å¨ Web çé¢ä¸å¼å¯äºå®ï¼ä»¥ä¾¿æ´å¤ç¨æ·å¯ä»¥å°è¯ã</p>
-<ol>
- <li>
- <p>Weâve already had a deployment of redash for data analyzing upon traditional databases like PostgreSQL, etc</p>
- </li>
- <li>
- <p>Redash is open source and easy to deploy, rich in visualization functions and has good integrations with other productivity tools we are using (like Slack).</p>
- </li>
-</ol>
+<h3 id="cube-planner">é»è®¤å¼å¯ Cube planner</h3>
+<p>Cube planner å¯ä»¥æ大å°ä¼å cube ç»æï¼åå°æ建ç cuboid æ°éï¼ä»èèç计ç®/åå¨èµæºå¹¶æé«æ¥è¯¢æ§è½ãå®æ¯å¨v2.3ä¸å¼å
¥çï¼ä½é»è®¤æ
åµä¸æ²¡æå¼å¯ã为äºè®©æ´å¤ç¨æ·çå°å¹¶å°è¯å®ï¼æ们é»è®¤å¨v2.5ä¸å¯ç¨å®ã ç®æ³å°å¨ç¬¬ä¸æ¬¡æ建 segment çæ¶åï¼æ ¹æ®æ°æ®ç»è®¡èªå¨ä¼å cuboid éå.</p>
-<p>Unfortunately, redash doesnât officially support Kylin as a data source for now. Thus we wrote a simple one to include it. The plugin has already been open sourced under BSD-2 license as a <a href="https://github.com/strikingly/redash-kylin">GitHub repository</a>.</p>
+<h3 id="segment-">æ¹è¿ç Segment åªæ</h3>
+<p>Segmentï¼ååºï¼ä¿®åªå¯ä»¥ææå°åå°ç£çåç½ç»I / Oï¼å æ¤å¤§å¤§æé«äºæ¥è¯¢æ§è½ã è¿å»ï¼Kylin åªæååºå (partition date column) çå¼è¿è¡ segment çä¿®åªã å¦ææ¥è¯¢ä¸æ²¡æå°ååºåä½ä¸ºè¿æ»¤æ¡ä»¶ï¼é£ä¹ä¿®åªå°ä¸èµ·ä½ç¨ï¼ä¼æ«æææsegmentã.<br />
+ç°å¨ä»v2.5å¼å§ï¼Kylin å°å¨ segment 级å«è®°å½æ¯ä¸ªç»´åº¦çæå°/æ大å¼ã å¨æ«æ segment ä¹åï¼ä¼å°æ¥è¯¢çæ¡ä»¶ä¸æå°/æ大索å¼è¿è¡æ¯è¾ã å¦æä¸å¹é
ï¼å°è·³è¿è¯¥ segmentã æ£æ¥KYLIN-3370äºè§£æ´å¤ä¿¡æ¯ã</p>
-<p>The redash-kylin plugin is just a single piece of python file which implements redashâs data source protocol. To install, retrieve the <code class="highlighter-rouge">kylin.py</code> file inside <code class="highlighter-rouge">redash/query_runner</code> folder of the pluginâs repository and place it under corresponding folder of redash.</p>
+<h3 id="yarn-">å¨ YARN ä¸å并åå
¸</h3>
+<p>å½ segment å并æ¶ï¼å®ä»¬çè¯å
¸ä¹éè¦å并ãå¨è¿å»ï¼åå
¸å并åçå¨ Kylin ç JVM ä¸ï¼è¿éè¦ä½¿ç¨å¤§éçæ¬å°å
åå CPU èµæºã å¨æ端æ
åµä¸ï¼å¦ææå 个并åä½ä¸ï¼ï¼å¯è½ä¼å¯¼è´ Kylin è¿ç¨å´©æºã å æ¤ï¼ä¸äºç¨æ·ä¸å¾ä¸ä¸º Kylin ä»»å¡èç¹åé
æ´å¤å
åï¼æè¿è¡å¤ä¸ªä»»å¡èç¹ä»¥å¹³è¡¡å·¥ä½è´è½½ã<br />
+ç°å¨ä»v2.5å¼å§ï¼Kylin å°æè¿é¡¹ä»»å¡æäº¤ç» Hadoop MapReduce å Sparkï¼è¿æ ·å°±å¯ä»¥è§£å³è¿ä¸ªç¶é¢é®é¢ã æ¥çKYLIN-3471äºè§£æ´å¤ä¿¡æ¯.</p>
-<p><img src="/images/blog/redash/redash_1.jpeg" alt="" /></p>
+<h3 id="cube-">æ¹è¿ä½¿ç¨å
¨å±åå
¸ç cube æ建æ§è½</h3>
+<p>å
¨å±åå
¸ (Global Dictionary) æ¯ bitmap 精确å»é计æ°çå¿
è¦æ¡ä»¶ãå¦æå»éåå
·æé常é«çåºæ°ï¼å GD å¯è½é常大ãå¨ cube æ建é¶æ®µï¼Kylin éè¦éè¿ GD å°éæ´æ°å¼è½¬æ¢ä¸ºæ´æ°ã尽管 GD 已被åæå¤ä¸ªåçï¼å¯ä»¥åå¼å è½½å°å
åï¼ä½æ¯ç±äºå»éåçå¼æ¯ä¹±åºçãKylin éè¦åå¤è½½å
¥åè½½åº(swap in/out)åçï¼è¿ä¼å¯¼è´æ建任å¡é常ç¼æ
¢ã<br />
+该å¢å¼ºåè½å¼å
¥äºä¸ä¸ªæ°æ¥éª¤ï¼ä¸ºæ¯ä¸ªæ°æ®åä»å
¨å±åå
¸ä¸æ建ä¸ä¸ªç¼©å°çåå
¸ã éåæ¯ä¸ªä»»å¡åªéè¦å 载缩å°çåå
¸ï¼ä»èé¿å
é¢ç¹çè½½å
¥åè½½åºãæ§è½å¯ä»¥æ¯ä»¥åå¿«3åãæ¥ç KYLIN-3491 äºè§£æ´å¤ä¿¡æ¯.</p>
-<p>Before you can use the plugin, you need to enable it first. Please modify the default enabled plugin list defined in <code class="highlighter-rouge">redash/settings.py</code>:</p>
+<h3 id="topn-count-distinct--cube-">æ¹è¿å« TOPN, COUNT DISTINCT ç cube 大å°ç估计</h3>
+<p>Cube ç大å°å¨æ建æ¶æ¯é¢å
估计çï¼å¹¶è¢«åç»å 个æ¥éª¤ä½¿ç¨ï¼ä¾å¦å³å® MR / Spark ä½ä¸çååºæ°ï¼è®¡ç® HBase region åå²çãå®çåç¡®ä¸å¦ä¼å¯¹æ建æ§è½äº§çå¾å¤§å½±åã å½åå¨ COUNT DISTINCTï¼TOPN ç度éæ¶åï¼å 为å®ä»¬ç大å°æ¯çµæ´»çï¼å æ¤ä¼°è®¡å¼å¯è½è·çå®å¼æå¾å¤§åå·®ã å¨è¿å»ï¼ç¨æ·éè¦è°æ´è¥å¹²ä¸ªåæ°ä»¥ä½¿å°ºå¯¸ä¼°è®¡æ´æ¥è¿å®é
尺寸ï¼è¿å¯¹æ®éç¨æ·æç¹å°é¾ã<br />
+ç°å¨ï¼Kylin å°æ ¹æ®æ¶éçç»è®¡ä¿¡æ¯èªå¨è°æ´å¤§å°ä¼°è®¡ãè¿å¯ä»¥ä½¿ä¼°è®¡å¼ä¸å®é
大å°æ´æ¥è¿ãæ¥ç KYLIN-3453 äºè§£æ´å¤ä¿¡æ¯ã</p>
-<p><img src="/images/blog/redash/redash_2.jpeg" alt="" /></p>
+<h3 id="hadoop-30hbase-20">æ¯æHadoop 3.0/HBase 2.0</h3>
+<p>Hadoop 3å HBase 2å¼å§è¢«è®¸å¤ç¨æ·éç¨ãç°å¨ Kylin æä¾ä½¿ç¨æ°ç Hadoop å HBase API ç¼è¯çæ°äºè¿å¶å
ãæ们已ç»å¨ Hortonworks HDP 3.0 å Cloudera CDH 6.0 ä¸è¿è¡äºæµè¯</p>
-<p>At last you have to rebuild the docker image (if you are using docker deployment) of redash and restart both server and worker of it. Currently, the redash-kylin plugin only supports the current stable version of redash (3.0.0) and 2.x version of Apache Kylin.</p>
+<p><strong>ä¸è½½</strong></p>
-<p>Once installed successfully, youâll be able to find a KylinAPI data source type at the New Data Source page. To use it, just select that source type and fill in required fields. The redash-kylin plugin works by calling Kylinâs HTTP RESTful API, thus you should make sure your redash deployment has an access to your Kylin cluster (either job mode or query mode).</p>
+<p>è¦ä¸è½½Apache Kylin v2.5.0æºä»£ç æäºè¿å¶å
ï¼è¯·è®¿é®<a href="http://kylin.apache.org/download">ä¸è½½é¡µé¢</a> .</p>
-<p><img src="/images/blog/redash/redash_3.jpeg" alt="" /></p>
+<p><strong>å级</strong></p>
-<p>After a data source is setup and the connection is tested ok. You should be able to view schemas, run queries and make visualizations from tables in Kylin. Just type the SQL query in and get the result out. For more details about redashâs usage, please refer to <a href="https://redash.io/help/">redashâs documentation</a>.</p>
+<p>åè<a href="/docs/howto/howto_upgrade.html">å级æå</a>.</p>
-<p><img src="/images/blog/redash/redash_4.jpeg" alt="" /></p>
+<p><strong>åé¦</strong></p>
-<p>You can also add multiple data sources by setting different project names or different API URLs. Itâs worth to mention that redash has an experiment function which supports making a query from former cached query results. Thus, once query results from different Kylin cluster has been imported, youâll be able to join them together for richer data processing.</p>
+<p>å¦ææ¨éå°é®é¢æçé®ï¼è¯·åéé®ä»¶è³ Apache Kylin dev æ user é®ä»¶å表ï¼dev@kylin.apache.orgï¼user@kylin.apache.org; å¨åéä¹åï¼è¯·ç¡®ä¿æ¨å·²éè¿åéçµåé®ä»¶è³ dev-subscribe@kylin.apache.org æ user-subscribe@kylin.apache.org订é
äºé®ä»¶å表ã</p>
-<p>Wish you have a good time with Redash-Kylin!</p>
+<p><em>é常æè°¢ææè´¡ç®Apache Kylinçæå!</em></p>
</description>
- <pubDate>Tue, 08 May 2018 13:00:00 -0700</pubDate>
- <link>http://kylin.apache.org/blog/2018/05/08/redash-kylin-plugin-strikingly/</link>
- <guid isPermaLink="true">http://kylin.apache.org/blog/2018/05/08/redash-kylin-plugin-strikingly/</guid>
+ <pubDate>Thu, 20 Sep 2018 13:00:00 -0700</pubDate>
+ <link>http://kylin.apache.org/cn/blog/2018/09/20/release-v2.5.0/</link>
+ <guid isPermaLink="true">http://kylin.apache.org/cn/blog/2018/09/20/release-v2.5.0/</guid>
<category>blog</category>