You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@kylin.apache.org by li...@apache.org on 2020/11/10 14:09:20 UTC

svn commit: r1883250 - in /kylin/site: cn/docs/tutorial/setup_systemcube.html docs/tutorial/setup_systemcube.html feed.xml

Author: lidong
Date: Tue Nov 10 14:09:19 2020
New Revision: 1883250

URL: http://svn.apache.org/viewvc?rev=1883250&view=rev
Log:
a little update

Modified:
    kylin/site/cn/docs/tutorial/setup_systemcube.html
    kylin/site/docs/tutorial/setup_systemcube.html
    kylin/site/feed.xml

Modified: kylin/site/cn/docs/tutorial/setup_systemcube.html
URL: http://svn.apache.org/viewvc/kylin/site/cn/docs/tutorial/setup_systemcube.html?rev=1883250&r1=1883249&r2=1883250&view=diff
==============================================================================
--- kylin/site/cn/docs/tutorial/setup_systemcube.html (original)
+++ kylin/site/cn/docs/tutorial/setup_systemcube.html Tue Nov 10 14:09:19 2020
@@ -182,16 +182,26 @@ var _hmt = _hmt || [];
   <p>自 Apache Kylin v2.3.0 起有效</p>
 </blockquote>
 
-<h2 id="cube">什么是系统 Cube</h2>
+<p>本节主要内容:</p>
+
+<ul>
+  <li><a href="#什么是系统 Cube">什么是系统 Cube</a></li>
+  <li><a href="#如何建立系统 Cube">如何建立系统 Cube</a></li>
+  <li><a href="#自动创建系统 Cube">自动创建系统 Cube</a></li>
+  <li><a href="#系统 Cube 的细节">系统 Cube 的细节</a></li>
+</ul>
+
+<h2 id="span-id-cube-cubespan"><span id="什么是系统 Cube">什么是系统 Cube</span></h2>
 
 <p>为了更好的支持自我监控,在系统 project 下创建一组系统 Cubes,叫做 “KYLIN_SYSTEM”。现在,这里有五个 Cubes。三个用于查询指标,”METRICS_QUERY”,”METRICS_QUERY_CUBE”,”METRICS_QUERY_RPC”。另外两个是 job 指标,”METRICS_JOB”,”METRICS_JOB_EXCEPTION”。</p>
 
-<h2 id="cube-1">如何建立系统 Cube</h2>
+<h2 id="span-id-cube-cubespan-1"><span id="如何建立系统 Cube">如何建立系统 Cube</span></h2>
 
-<h3 id="section">准备</h3>
-<p>在 KYLIN_HOME 目录下创建一个配置文件 SCSinkTools.json。</p>
+<p>本节我们介绍手动启用系统 Cube 的方法,如果您希望通过 shell 脚本自动创建系统 Cube,请参考<a href="#自动创建系统 Cube">自动创建系统 Cube</a>。</p>
 
-<p>例如:</p>
+<h3 id="section">1. 准备</h3>
+
+<p>在 KYLIN_HOME 目录下创建一个配置文件 SCSinkTools.json。例如:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>[
     {
@@ -206,8 +216,8 @@ var _hmt = _hmt || [];
 </code></pre>
 </div>
 
-<h3 id="metadata">1. 生成 Metadata</h3>
-<p>在 KYLIN_HOME 文件夹下运行一下命令生成相关的 metadata:</p>
+<h3 id="metadata">2. 生成 Metadata</h3>
+<p>在 KYLIN_HOME 文件夹下运行以下命令生成相关的 metadata:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>./bin/kylin.sh org.apache.kylin.tool.metrics.systemcube.SCCreator \
 -inputConfig SCSinkTools.json \
@@ -219,39 +229,31 @@ var _hmt = _hmt || [];
 
 <p><img src="/images/SystemCube/metadata.png" alt="metadata" /></p>
 
-<h3 id="section-1">2. 建立数据源</h3>
-<p>运行下列命令生成 hive 源表:</p>
+<h3 id="section-1">3. 建立数据源</h3>
+<p>运行下列命令生成 Hive 源表:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>hive -f &lt;output_forder&gt;/create_hive_tables_for_system_cubes.sql
 </code></pre>
 </div>
 
-<p>通过这个命令,相关的 hive 表将会被创建。</p>
+<p>通过这个命令,相关的 hive 表将会被创建。每一个系统 Cube 中的事实表对应了一张 Hive 源表,Hive 源表中记录了查询或任务相关的数据,这些数据将为系统 Cube 服务。</p>
 
 <p><img src="/images/SystemCube/hive_table.png" alt="hive_table" /></p>
 
-<h3 id="system-cubes--metadata">3. 为 System Cubes 上传 Metadata</h3>
+<h3 id="cubes--metadata">4. 为系统 Cubes 上传 Metadata</h3>
 <p>然后我们需要通过下列命令上传 metadata 到 hbase:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>./bin/metastore.sh restore &lt;output_forder&gt;
 </code></pre>
 </div>
 
-<h3 id="metadata-1">4. 重载 Metadata</h3>
-<p>最终,我们需要在 Kylin web UI 重载 metadata。</p>
-
-<p>然后,一组系统 Cubes 将会被创建在系统 project 下,称为 “KYLIN_SYSTEM”。</p>
-
-<h3 id="cube-build">5. 系统 Cube build</h3>
-<p>当系统 Cube 被创建,我们需要定期 build Cube。</p>
+<h3 id="metadata-1">5. 重载 Metadata</h3>
+<p>最终,我们需要在 Kylin web UI 重载 metadata。然后,一组系统 Cubes 将会被创建在系统 project 下,称为 “KYLIN_SYSTEM”。</p>
 
-<ol>
-  <li>
-    <p>创建一个 shell 脚本其通过调用 org.apache.kylin.tool.job.CubeBuildingCLI 来 build 系统 Cube</p>
+<h3 id="cube">6. 构建系统 Cube</h3>
+<p>当系统 Cube 被创建,我们需要定期构建 Cube。方法如下:</p>
 
-    <p>例如:</p>
-  </li>
-</ol>
+<p><strong>步骤一</strong>:创建一个 shell 脚本,通过调用 org.apache.kylin.tool.job.CubeBuildingCLI 来构建系统 Cube。例如:</p>
 
 <div class="highlight"><pre><code class="language-groff" data-lang="groff">#!/bin/bash
 
@@ -270,13 +272,7 @@ ID="$END"
 echo "building for ${CUBE}_${ID}" &gt;&gt; ${KYLIN_HOME}/logs/build_trace.log
 sh ${KYLIN_HOME}/bin/kylin.sh org.apache.kylin.tool.job.CubeBuildingCLI --cube ${CUBE} --endTime ${END} &gt; ${KYLIN_HOME}/logs/system_cube_${CUBE}_${END}.log 2&gt;&amp;1 &amp;</code></pre></div>
 
-<ol>
-  <li>
-    <p>然后定期运行这个 shell 脚本</p>
-
-    <p>例如,像接下来这样添加一个 cron job:</p>
-  </li>
-</ol>
+<p><strong>步骤二</strong>:定期运行这个 shell 脚本。例如,像接下来这样添加一个 cron job:</p>
 
 <div class="highlight"><pre><code class="language-groff" data-lang="groff">0 */2 * * * sh ${KYLIN_HOME}/bin/system_cube_build.sh KYLIN_HIVE_METRICS_QUERY_QA 3600000 1200000
 
@@ -288,26 +284,29 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
 
 50 */12 * * * sh ${KYLIN_HOME}/bin/system_cube_build.sh KYLIN_HIVE_METRICS_JOB_EXCEPTION_QA 3600000 12000</code></pre></div>
 
-<h2 id="cube-2">自动创建系统cube</h2>
+<h2 id="span-id-cube-cubespan-2"><span id="自动创建系统 Cube">自动创建系统 Cube</span></h2>
 
-<p>从kylin 2.6.0开始提供system-cube.sh脚本,用户可以通过执行此脚本来自动创建系统cube。</p>
+<p>从kylin 2.6.0 开始提供 system-cube.sh 脚本,用户可以通过执行此脚本来自动创建系统 Cube。</p>
 
 <ul>
   <li>
-    <p>创建系统cube:<code class="highlighter-rouge">sh system-cube.sh setup</code></p>
+    <p>创建系统 Cube:<code class="highlighter-rouge">sh system-cube.sh setup</code></p>
   </li>
   <li>
-    <p>构建系统cube:<code class="highlighter-rouge">sh bin/system-cube.sh build</code></p>
+    <p>构建系统 Cube:<code class="highlighter-rouge">sh bin/system-cube.sh build</code></p>
   </li>
   <li>
-    <p>为系统cube添加定时任务:<code class="highlighter-rouge">bin/system.sh cron</code></p>
+    <p>为系统 Cube 添加定时任务:<code class="highlighter-rouge">bin/system.sh cron</code></p>
   </li>
 </ul>
 
-<h2 id="cube-">系统 Cube 的细节</h2>
+<h2 id="span-id-cube--cube-span"><span id="系统 Cube 的细节">系统 Cube 的细节</span></h2>
+
+<p>Hive 中有 5 张表记录了 Kylin 系统的相关指标数据,每一个系统 Cube 的事实表对应了一张 Hive 表,共有 5 个系统 Cube。</p>
 
 <h3 id="dimension">普通 Dimension</h3>
-<p>对于这些 Cube,admins 能够用四个时间粒度查询。从高级别到低级别,如下:</p>
+
+<p>对于这些系统 Cube,admins 能够用四个时间粒度查询,这些维度在 5 个系统 Cube 中均生效。从高级别到低级别,如下:</p>
 
 <table>
   <tr>
@@ -340,12 +339,16 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td>the host of server for query engine</td>
   </tr>
   <tr>
+    <td>KUSER</td>
+    <td>the user who executes the query</td>
+  </tr>
+  <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the query executes</td>
   </tr>
   <tr>
     <td>REALIZATION</td>
-    <td>in Kylin,there are two OLAP realizations: Cube,or Hybrid of Cubes</td>
+    <td>the cube which the query hits. In Kylin, there are two OLAP realizations: Cube, or Hybrid of Cubes</td>
   </tr>
   <tr>
     <td>REALIZATION_TYPE</td>
@@ -353,11 +356,11 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>QUERY_TYPE</td>
-    <td>users can query on different data sources,CACHE,OLAP,LOOKUP_TABLE,HIVE</td>
+    <td>users can query on different data sources: CACHE, OLAP, LOOKUP_TABLE, HIVE</td>
   </tr>
   <tr>
     <td>EXCEPTION</td>
-    <td>when doing query,exceptions may happen. It's for classifying different exception types</td>
+    <td>when doing query, exceptions may happen. It's for classifying different exception types</td>
   </tr>
 </table>
 
@@ -370,19 +373,19 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MIN,MAX,SUM of QUERY_TIME_COST</td>
+    <td>MIN, MAX, SUM, PERCENTILE_APPROX of QUERY_TIME_COST</td>
     <td>the time cost for the whole query</td>
   </tr>
   <tr>
-    <td>MAX,SUM of CALCITE_SIZE_RETURN</td>
+    <td>MAX, SUM of CALCITE_SIZE_RETURN</td>
     <td>the row count of the result Calcite returns</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_SIZE_RETURN</td>
+    <td>MAX, SUM of STORAGE_SIZE_RETURN</td>
     <td>the row count of the input to Calcite</td>
   </tr>
   <tr>
-    <td>MAX,SUM of CALCITE_SIZE_AGGREGATE_FILTER</td>
+    <td>MAX, SUM of CALCITE_SIZE_AGGREGATE_FILTER</td>
     <td>the row count of Calcite aggregates and filters</td>
   </tr>
   <tr>
@@ -404,11 +407,11 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the query executes</td>
   </tr>
   <tr>
     <td>REALIZATION</td>
-    <td></td>
+    <td>the cube which the query hits</td>
   </tr>
   <tr>
     <td>RPC_SERVER</td>
@@ -416,7 +419,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>EXCEPTION</td>
-    <td>the exception of a rpc call. If no exception,"NULL" is used</td>
+    <td>the exception of a rpc call. If no exception, "NULL" is used</td>
   </tr>
 </table>
 
@@ -429,28 +432,28 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MAX,SUM of CALL_TIME</td>
+    <td>MAX, SUM, PERCENTILE_APPROX of CALL_TIME</td>
     <td>the time cost of a rpc all</td>
   </tr>
   <tr>
-    <td>MAX,SUM of COUNT_SKIP</td>
-    <td>based on fuzzy filters or else,a few rows will be skiped. This indicates the skipped row count</td>
+    <td>MAX, SUM of COUNT_SKIP</td>
+    <td>based on fuzzy filters or else, a few rows will be skiped. This indicates the skipped row count</td>
   </tr>
   <tr>
-    <td>MAX,SUM of SIZE_SCAN</td>
+    <td>MAX, SUM of SIZE_SCAN</td>
     <td>the row count actually scanned</td>
   </tr>
   <tr>
-    <td>MAX,SUM of SIZE_RETURN</td>
+    <td>MAX, SUM of SIZE_RETURN</td>
     <td>the row count actually returned</td>
   </tr>
   <tr>
-    <td>MAX,SUM of SIZE_AGGREGATE</td>
+    <td>MAX, SUM of SIZE_AGGREGATE</td>
     <td>the row count actually aggregated</td>
   </tr>
   <tr>
-    <td>MAX,SUM of SIZE_AGGREGATE_FILTER</td>
-    <td>the row count actually aggregated and filtered,= SIZE_SCAN - SIZE_RETURN</td>
+    <td>MAX, SUM of SIZE_AGGREGATE_FILTER</td>
+    <td>the row count actually aggregated and filtered, = SIZE_SCAN - SIZE_RETURN</td>
   </tr>
 </table>
 
@@ -466,6 +469,10 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
+    <td>SEGMENT_NAME</td>
+    <td></td>
+  </tr>
+  <tr>
     <td>CUBOID_SOURCE</td>
     <td>source cuboid parsed based on query and Cube design</td>
   </tr>
@@ -482,7 +489,6 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td>whether a query on this Cube is successful or not</td>
   </tr>
 </table>
-
 <table>
   <tr>
     <th colspan="2">Measure</th>
@@ -492,36 +498,40 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_CALL_COUNT</td>
+    <td>WEIGHT_PER_HIT</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of STORAGE_CALL_COUNT</td>
     <td>the number of rpc calls for a query hit on this Cube</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_CALL_TIME_SUM</td>
+    <td>MAX, SUM of STORAGE_CALL_TIME_SUM</td>
     <td>sum of time cost for the rpc calls of a query</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_CALL_TIME_MAX</td>
+    <td>MAX, SUM of STORAGE_CALL_TIME_MAX</td>
     <td>max of time cost among the rpc calls of a query</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_COUNT_SKIP</td>
+    <td>MAX, SUM of STORAGE_COUNT_SKIP</td>
     <td>the sum of row count skipped for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_SIZE_SCAN</td>
+    <td>MAX, SUM of STORAGE_COUNT_SCAN</td>
     <td>the sum of row count scanned for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_SIZE_RETURN</td>
+    <td>MAX, SUM of STORAGE_COUNT_RETURN</td>
     <td>the sum of row count returned for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_SIZE_AGGREGATE</td>
+    <td>MAX, SUM of STORAGE_COUNT_AGGREGATE</td>
     <td>the sum of row count aggregated for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX,SUM of STORAGE_SIZE_AGGREGATE_FILTER</td>
-    <td>the sum of row count aggregated and filtered for the related rpc calls,= STORAGE_SIZE_SCAN - STORAGE_SIZE_RETURN</td>
+    <td>MAX, SUM of STORAGE_COUNT_AGGREGATE_FILTER</td>
+    <td>the sum of row count aggregated and filtered for the related rpc calls, = STORAGE_SIZE_SCAN - STORAGE_SIZE_RETURN</td>
   </tr>
 </table>
 
@@ -538,20 +548,28 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <th colspan="2">Dimension</th>
   </tr>
   <tr>
+    <td>HOST</td>
+    <td>the host of server for job engine</td>
+  </tr>
+  <tr>
+    <td>KUSER</td>
+    <td>the user who run the job</td>
+  </tr>
+  <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the job runs</td>
   </tr>
   <tr>
     <td>CUBE_NAME</td>
-    <td></td>
+    <td>the cube with which the job is related</td>
   </tr>
   <tr>
     <td>JOB_TYPE</td>
-    <td></td>
+    <td>build, merge or optimize</td>
   </tr>
   <tr>
     <td>CUBING_TYPE</td>
-    <td>in kylin,there are two cubing algorithms,Layered &amp; Fast(InMemory)</td>
+    <td>in kylin, there are two cubing algorithms, Layered &amp; Fast(InMemory)</td>
   </tr>
 </table>
 
@@ -564,25 +582,41 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MIN,MAX,SUM of DURATION</td>
+    <td>MIN, MAX, SUM, PERCENTILE_APPROX of DURATION</td>
     <td>the duration from a job start to finish</td>
   </tr>
   <tr>
-    <td>MIN,MAX,SUM of TABLE_SIZE</td>
+    <td>MIN, MAX, SUM of TABLE_SIZE</td>
     <td>the size of data source in bytes</td>
   </tr>
   <tr>
-    <td>MIN,MAX,SUM of CUBE_SIZE</td>
+    <td>MIN, MAX, SUM of CUBE_SIZE</td>
     <td>the size of created Cube segment in bytes</td>
   </tr>
   <tr>
-    <td>MIN,MAX,SUM of PER_BYTES_TIME_COST</td>
+    <td>MIN, MAX, SUM of PER_BYTES_TIME_COST</td>
     <td>= DURATION / TABLE_SIZE</td>
   </tr>
   <tr>
-    <td>MIN,MAX,SUM of WAIT_RESOURCE_TIME</td>
+    <td>MIN, MAX, SUM of WAIT_RESOURCE_TIME</td>
     <td>a job may includes serveral MR(map reduce) jobs. Those MR jobs may wait because of lack of Hadoop resources.</td>
   </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_distinct_columns</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_dictionary</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_inmem_cubing</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_hfile_convert</td>
+    <td></td>
+  </tr>
 </table>
 
 <h3 id="metricsjobexception">METRICS_JOB_EXCEPTION</h3>
@@ -593,24 +627,32 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <th colspan="2">Dimension</th>
   </tr>
   <tr>
+    <td>HOST</td>
+    <td>the host of server for job engine</td>
+  </tr>
+  <tr>
+    <td>KUSER</td>
+    <td>the user who run a job</td>
+  </tr>
+  <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the job runs</td>
   </tr>
   <tr>
     <td>CUBE_NAME</td>
-    <td></td>
+    <td>the cube with which the job is related</td>
   </tr>
   <tr>
     <td>JOB_TYPE</td>
-    <td></td>
+    <td>build, merge or optimize</td>
   </tr>
   <tr>
     <td>CUBING_TYPE</td>
-    <td></td>
+    <td>in kylin, there are two cubing algorithms, Layered &amp; Fast(InMemory)</td>
   </tr>
   <tr>
     <td>EXCEPTION</td>
-    <td>when running a job,exceptions may happen. It's for classifying different exception types</td>
+    <td>when running a job, exceptions may happen. It's for classifying different exception types</td>
   </tr>
 </table>
 

Modified: kylin/site/docs/tutorial/setup_systemcube.html
URL: http://svn.apache.org/viewvc/kylin/site/docs/tutorial/setup_systemcube.html?rev=1883250&r1=1883249&r2=1883250&view=diff
==============================================================================
--- kylin/site/docs/tutorial/setup_systemcube.html (original)
+++ kylin/site/docs/tutorial/setup_systemcube.html Tue Nov 10 14:09:19 2020
@@ -8589,13 +8589,25 @@ var _hmt = _hmt || [];
   <p>Available since Apache Kylin v2.3.0</p>
 </blockquote>
 
-<h2 id="what-is-system-cube">What is System Cube</h2>
+<p>Main content of this section:</p>
+
+<ul>
+  <li><a href="#What is System Cube">What is System Cube</a></li>
+  <li><a href="#How to Set Up System Cube">How to Set Up System Cube</a></li>
+  <li><a href="#Automatically create System Cube">Automatically create System Cube</a></li>
+  <li><a href="#Details of System Cube">Details of System Cube</a></li>
+</ul>
+
+<h2 id="span-idwhat-is-system-cubewhat-is-system-cubespan"><span id="What is System Cube">What is System Cube</span></h2>
 
 <p>For better supporting self-monitoring, a set of system Cubes are created under the system project, called “KYLIN_SYSTEM”. Currently, there are five Cubes. Three are for query metrics, “METRICS_QUERY”, “METRICS_QUERY_CUBE”, “METRICS_QUERY_RPC”. And the other two are for job metrics, “METRICS_JOB”, “METRICS_JOB_EXCEPTION”.</p>
 
-<h2 id="how-to-set-up-system-cube">How to Set Up System Cube</h2>
+<h2 id="span-idhow-to-set-up-system-cubehow-to-set-up-system-cubespan"><span id="How to Set Up System Cube">How to Set Up System Cube</span></h2>
+
+<p>In this section, we will introduce the method of manually enabling the system cube. If you want to automatically enable the system cube through shell scripts, please refer to <a href="#Automatically Create System Cube">Automatically Create System Cube</a>.</p>
+
+<h3 id="prepare">1. Prepare</h3>
 
-<h3 id="prepare">Prepare</h3>
 <p>Create a configuration file SCSinkTools.json in KYLIN_HOME directory.</p>
 
 <p>For example:</p>
@@ -8613,7 +8625,7 @@ var _hmt = _hmt || [];
 </code></pre>
 </div>
 
-<h3 id="generate-metadata">1. Generate Metadata</h3>
+<h3 id="generate-metadata">2. Generate Metadata</h3>
 <p>Run the following command in KYLIN_HOME folder to generate related metadata:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>./bin/kylin.sh org.apache.kylin.tool.metrics.systemcube.SCCreator \
@@ -8626,7 +8638,7 @@ var _hmt = _hmt || [];
 
 <p><img src="/images/SystemCube/metadata.png" alt="metadata" /></p>
 
-<h3 id="set-up-datasource">2. Set Up Datasource</h3>
+<h3 id="set-up-datasource">3. Set Up Datasource</h3>
 <p>Running the following command to create source hive tables:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>hive -f &lt;output_folder&gt;/create_hive_tables_for_system_cubes.sql
@@ -8637,28 +8649,24 @@ var _hmt = _hmt || [];
 
 <p><img src="/images/SystemCube/hive_table.png" alt="hive_table" /></p>
 
-<h3 id="upload-metadata-for-system-cubes">3. Upload Metadata for System Cubes</h3>
+<h3 id="upload-metadata-for-system-cubes">4. Upload Metadata for System Cubes</h3>
 <p>Then we need to upload metadata to hbase by the following command:</p>
 
 <div class="highlighter-rouge"><pre class="highlight"><code>./bin/metastore.sh restore &lt;output_folder&gt;
 </code></pre>
 </div>
 
-<h3 id="reload-metadata">4. Reload Metadata</h3>
+<h3 id="reload-metadata">5. Reload Metadata</h3>
 <p>Finally, we need to reload metadata in Kylin web UI.</p>
 
 <p>Then, a set of system Cubes will be created under the system project, called “KYLIN_SYSTEM”.</p>
 
-<h3 id="system-cube-build">5. System Cube build</h3>
+<h3 id="system-cube-build">6. System Cube build</h3>
 <p>When the system Cube is created, we need to build the Cube regularly.</p>
 
-<ol>
-  <li>
-    <p>Create a shell script that builds the system Cube by calling org.apache.kylin.tool.job.CubeBuildingCLI</p>
+<p><strong>Step 1</strong>. Create a shell script that builds the system Cube by calling <code class="highlighter-rouge">org.apache.kylin.tool.job.CubeBuildingCLI</code></p>
 
-    <p>For example:</p>
-  </li>
-</ol>
+<p>For example:</p>
 
 <div class="highlight"><pre><code class="language-groff" data-lang="groff">#!/bin/bash
 
@@ -8677,13 +8685,7 @@ ID="$END"
 echo "building for ${CUBE}_${ID}" &gt;&gt; ${KYLIN_HOME}/logs/build_trace.log
 sh ${KYLIN_HOME}/bin/kylin.sh org.apache.kylin.tool.job.CubeBuildingCLI --cube ${CUBE} --endTime ${END} &gt; ${KYLIN_HOME}/logs/system_cube_${CUBE}_${END}.log 2&gt;&amp;1 &amp;</code></pre></div>
 
-<ol>
-  <li>
-    <p>Then run this shell script regularly</p>
-
-    <p>For example, add a cron job as follows:</p>
-  </li>
-</ol>
+<p><strong>Step 2</strong>. Then run this shell script regularly. For example, add a cron job as follows:</p>
 
 <div class="highlight"><pre><code class="language-groff" data-lang="groff">0 */2 * * * sh ${KYLIN_HOME}/bin/system_cube_build.sh KYLIN_HIVE_METRICS_QUERY_QA 3600000 1200000
 
@@ -8695,7 +8697,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
 
 50 */12 * * * sh ${KYLIN_HOME}/bin/system_cube_build.sh KYLIN_HIVE_METRICS_JOB_EXCEPTION_QA 3600000 12000</code></pre></div>
 
-<h2 id="automatically-create-system-cube">Automatically create System Cube</h2>
+<h2 id="span-idautomatically-create-system-cubeautomatically-create-system-cubespan"><span id="Automatically create System Cube">Automatically create System Cube</span></h2>
 
 <p>Kylin provides system-cube.sh from v2.6.0, users can automatically create system cube by executing this script.</p>
 
@@ -8711,7 +8713,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </li>
 </ul>
 
-<h2 id="details-of-system-cube">Details of System Cube</h2>
+<h2 id="span-iddetails-of-system-cubedetails-of-system-cubespan"><span id="Details of System Cube">Details of System Cube</span></h2>
 
 <h3 id="common-dimension">Common Dimension</h3>
 <p>For all of these Cube, admins can query at four time granularities. From higher level to lower, it’s as follows:</p>
@@ -8747,12 +8749,16 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td>the host of server for query engine</td>
   </tr>
   <tr>
+    <td>KUSER</td>
+    <td>the user who executes the query</td>
+  </tr>
+  <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the query executes</td>
   </tr>
   <tr>
     <td>REALIZATION</td>
-    <td>in Kylin, there are two OLAP realizations: Cube, or Hybrid of Cubes</td>
+    <td>the cube which the query hits. In Kylin, there are two OLAP realizations: Cube, or Hybrid of Cubes</td>
   </tr>
   <tr>
     <td>REALIZATION_TYPE</td>
@@ -8760,7 +8766,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>QUERY_TYPE</td>
-    <td>users can query on different data sources, CACHE, OLAP, LOOKUP_TABLE, HIVE</td>
+    <td>users can query on different data sources: CACHE, OLAP, LOOKUP_TABLE, HIVE</td>
   </tr>
   <tr>
     <td>EXCEPTION</td>
@@ -8777,7 +8783,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MIN, MAX, SUM of QUERY_TIME_COST</td>
+    <td>MIN, MAX, SUM, PERCENTILE_APPROX of QUERY_TIME_COST</td>
     <td>the time cost for the whole query</td>
   </tr>
   <tr>
@@ -8811,11 +8817,11 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the query executes</td>
   </tr>
   <tr>
     <td>REALIZATION</td>
-    <td></td>
+    <td>the cube which the query hits</td>
   </tr>
   <tr>
     <td>RPC_SERVER</td>
@@ -8823,7 +8829,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>EXCEPTION</td>
-    <td>the exception of a rpc call. If no exception, "NULL" is used</td>
+    <td>the exception of a rpc call. If no exception,"NULL" is used</td>
   </tr>
 </table>
 
@@ -8836,12 +8842,12 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MAX, SUM of CALL_TIME</td>
+    <td>MAX, SUM, PERCENTILE_APPROX of CALL_TIME</td>
     <td>the time cost of a rpc all</td>
   </tr>
   <tr>
     <td>MAX, SUM of COUNT_SKIP</td>
-    <td>based on fuzzy filters or else, a few rows will be skipped. This indicates the skipped row count</td>
+    <td>based on fuzzy filters or else, a few rows will be skiped. This indicates the skipped row count</td>
   </tr>
   <tr>
     <td>MAX, SUM of SIZE_SCAN</td>
@@ -8873,12 +8879,16 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
+    <td>SEGMENT_NAME</td>
+    <td></td>
+  </tr>
+  <tr>
     <td>CUBOID_SOURCE</td>
     <td>source cuboid parsed based on query and Cube design</td>
   </tr>
   <tr>
     <td>CUBOID_TARGET</td>
-    <td>target cuboid already pre-calculated and served for source cuboid</td>
+    <td>target cuboid already precalculated and served for source cuboid</td>
   </tr>
   <tr>
     <td>IF_MATCH</td>
@@ -8889,7 +8899,6 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td>whether a query on this Cube is successful or not</td>
   </tr>
 </table>
-
 <table>
   <tr>
     <th colspan="2">Measure</th>
@@ -8899,6 +8908,10 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
+    <td>WEIGHT_PER_HIT</td>
+    <td></td>
+  </tr>
+  <tr>
     <td>MAX, SUM of STORAGE_CALL_COUNT</td>
     <td>the number of rpc calls for a query hit on this Cube</td>
   </tr>
@@ -8915,19 +8928,19 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td>the sum of row count skipped for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX, SUM of STORAGE_SIZE_SCAN</td>
+    <td>MAX, SUM of STORAGE_COUNT_SCAN</td>
     <td>the sum of row count scanned for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX, SUM of STORAGE_SIZE_RETURN</td>
+    <td>MAX, SUM of STORAGE_COUNT_RETURN</td>
     <td>the sum of row count returned for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX, SUM of STORAGE_SIZE_AGGREGATE</td>
+    <td>MAX, SUM of STORAGE_COUNT_AGGREGATE</td>
     <td>the sum of row count aggregated for the related rpc calls</td>
   </tr>
   <tr>
-    <td>MAX, SUM of STORAGE_SIZE_AGGREGATE_FILTER</td>
+    <td>MAX, SUM of STORAGE_COUNT_AGGREGATE_FILTER</td>
     <td>the sum of row count aggregated and filtered for the related rpc calls, = STORAGE_SIZE_SCAN - STORAGE_SIZE_RETURN</td>
   </tr>
 </table>
@@ -8945,16 +8958,24 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <th colspan="2">Dimension</th>
   </tr>
   <tr>
+    <td>HOST</td>
+    <td>the host of server for job engine</td>
+  </tr>
+  <tr>
+    <td>KUSER</td>
+    <td>the user who run the job</td>
+  </tr>
+  <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the job runs</td>
   </tr>
   <tr>
     <td>CUBE_NAME</td>
-    <td></td>
+    <td>the cube with which the job is related</td>
   </tr>
   <tr>
     <td>JOB_TYPE</td>
-    <td></td>
+    <td>build, merge or optimize</td>
   </tr>
   <tr>
     <td>CUBING_TYPE</td>
@@ -8971,7 +8992,7 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <td></td>
   </tr>
   <tr>
-    <td>MIN, MAX, SUM of DURATION</td>
+    <td>MIN, MAX, SUM, PERCENTILE_APPROX of DURATION</td>
     <td>the duration from a job start to finish</td>
   </tr>
   <tr>
@@ -8988,7 +9009,23 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
   </tr>
   <tr>
     <td>MIN, MAX, SUM of WAIT_RESOURCE_TIME</td>
-    <td>a job may includes several MR(map reduce) jobs. Those MR jobs may wait because of lack of Hadoop resources.</td>
+    <td>a job may includes serveral MR(map reduce) jobs. Those MR jobs may wait because of lack of Hadoop resources.</td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_distinct_columns</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_dictionary</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_inmem_cubing</td>
+    <td></td>
+  </tr>
+  <tr>
+    <td>MAX, SUM of step_duration_hfile_convert</td>
+    <td></td>
   </tr>
 </table>
 
@@ -9000,20 +9037,28 @@ sh ${KYLIN_HOME}/bin/kylin.sh org.apache
     <th colspan="2">Dimension</th>
   </tr>
   <tr>
+    <td>HOST</td>
+    <td>the host of server for job engine</td>
+  </tr>
+  <tr>
+    <td>KUSER</td>
+    <td>the user who run a job</td>
+  </tr>
+  <tr>
     <td>PROJECT</td>
-    <td></td>
+    <td>the project where the job runs</td>
   </tr>
   <tr>
     <td>CUBE_NAME</td>
-    <td></td>
+    <td>the cube with which the job is related</td>
   </tr>
   <tr>
     <td>JOB_TYPE</td>
-    <td></td>
+    <td>build, merge or optimize</td>
   </tr>
   <tr>
     <td>CUBING_TYPE</td>
-    <td></td>
+    <td>in kylin, there are two cubing algorithms, Layered &amp; Fast(InMemory)</td>
   </tr>
   <tr>
     <td>EXCEPTION</td>

Modified: kylin/site/feed.xml
URL: http://svn.apache.org/viewvc/kylin/site/feed.xml?rev=1883250&r1=1883249&r2=1883250&view=diff
==============================================================================
--- kylin/site/feed.xml (original)
+++ kylin/site/feed.xml Tue Nov 10 14:09:19 2020
@@ -19,8 +19,8 @@
     <description>Apache Kylin Home</description>
     <link>http://kylin.apache.org/</link>
     <atom:link href="http://kylin.apache.org/feed.xml" rel="self" type="application/rss+xml"/>
-    <pubDate>Sun, 18 Oct 2020 05:37:07 -0700</pubDate>
-    <lastBuildDate>Sun, 18 Oct 2020 05:37:07 -0700</lastBuildDate>
+    <pubDate>Tue, 10 Nov 2020 05:59:14 -0800</pubDate>
+    <lastBuildDate>Tue, 10 Nov 2020 05:59:14 -0800</lastBuildDate>
     <generator>Jekyll v2.5.3</generator>
     
       <item>