You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@inlong.apache.org by do...@apache.org on 2022/11/15 14:15:27 UTC

[inlong-website] 01/01: [INLONG-600][Doc] Update the concept and introduction guide

This is an automated email from the ASF dual-hosted git repository.

dockerzhang pushed a commit to branch fix-600
in repository https://gitbox.apache.org/repos/asf/inlong-website.git

commit aa1f59dcb3cc1fb979ea4bcf123f79f553faf710
Author: dockerzhang <do...@apache.org>
AuthorDate: Tue Nov 15 22:15:15 2022 +0800

    [INLONG-600][Doc] Update the concept and introduction guide
---
 docs/design_and_concept/basic_concept.md           | 28 ++++++++++++----------
 docs/introduction.md                               |  6 ++---
 .../current/design_and_concept/basic_concept.md    | 28 ++++++++++++----------
 .../current/introduction.md                        | 12 +++++-----
 4 files changed, 39 insertions(+), 35 deletions(-)

diff --git a/docs/design_and_concept/basic_concept.md b/docs/design_and_concept/basic_concept.md
index 823e96dee4..cf41f67583 100644
--- a/docs/design_and_concept/basic_concept.md
+++ b/docs/design_and_concept/basic_concept.md
@@ -3,16 +3,18 @@ title: Basic Concept
 sidebar_position: 1
 ---
 
-| Name                     | Description                                                                                                                | Other                                                                              |
-|--------------------------|----------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------|
-| Standard Architecture    | Contains all InLong components such as InLong Agent/Manager/MQ/Sort/Dashboard                                              | Suitable for massive data and large-scale production environments                  |
-| Lightweight Architecture | Contains only one component of InLong Sort, which can be used with Manager/Dashboard                                       | The lightweight architecture is simple and flexible, suitable for small-scale data |
-| Group                    | Data Streams Group, it contains multiple data streams, and one Group represents one data ingestion.                        | Group has attributes such as ID and Name.                                          |
-| Stream                   | Data Stream, a stream has a specific flow direction.                                                                       | Stream has attributes such as ID, Name, and data fields.                           |
-| Node                     | Data Node, including `Extract Node` and `Load Node`,  stands for the data source and sink types separately.                |                                                                                    |
-| InLongMsg                | InLong data format, if you consume message directly from the message queue, you need to perform `InLongMsg` parsing first. |                                                                                    |
-| Agent                    | The standard architecture uses Agent for data collection, and Agent represents different types of collection capabilities. | It contains File Agent, SQL Agent, Binlog Agent, etc.                              |
-| DataProxy                | Forward received data to different message queues.                                                                         | Supports data transmission blocking, placing retransmission.                       |
-| Sort                     | Data stream sorting.                                                                                                       | Sort-flink based on Flink, sort-standalone for local sorting.                      |
-| TubeMQ                   | InLong's self-developed message queuing service                                                                            | It can also be called Tube, with low-cost, high-performance features.              |
-| Pulsar                   | [Apache Pulsar](https://pulsar.apache.org/), a high-performance, high-consistency message queue service                    |                                                                                    |
\ No newline at end of file
+| Name        | Description                                                                                                                | Other                                                                              |
+|-------------|----------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------|
+| Standard    | Standard Architecture, contains all InLong components such as InLong Agent, Manager, MQ, Sort, Dashboard                   | Suitable for massive data and large-scale production environments                  |
+| Lightweight | Lightweight Architecture, contains only one component of InLong Sort, which also can be used with Manager, Dashboard       | The lightweight architecture is simple and flexible, suitable for small-scale data |
+| Group       | Data Streams Group, it contains multiple data streams, and one Group represents one data business unit.                    | Group has attributes such as ID and Name.                                          |
+| Stream      | Data Stream, a stream has a specific data source, data format and data sink.                                               | Stream has attributes such as ID, Name, and data fields.                           |
+| Node        | Data Node, including `Extract Node` and `Load Node`,  stands for the data source and sink types separately.                |                                                                                    |
+| InLongMsg   | InLong data format, if you consume message directly from the message queue, you need to perform `InLongMsg` parsing first. |                                                                                    |
+| Cluster     | Each component can form a single cluster.                                                                                  | Contains cluster name, label, necessary information for each component, etc.       |
+| Tag         | Clusters of different components can use the same tag to represent a set of data stream execution units.                   | Currently tag are only available for clusters                                      |
+| Agent       | The standard architecture uses Agent for data collection, and Agent represents different types of collection capabilities. | It contains File Agent, SQL Agent, Binlog Agent, etc.                              |
+| DataProxy   | Forward received data to different message queues.                                                                         | Supports data transmission blocking, placing retransmission.                       |
+| Sort        | Data stream sorting.                                                                                                       | Sort-flink based on Flink, sort-standalone for local sorting.                      |
+| TubeMQ      | InLong's self-developed message queuing service                                                                            | It can also be called Tube, with low-cost, high-performance features.              |
+| Pulsar      | [Apache Pulsar](https://pulsar.apache.org/), a high-performance, high-consistency message queue service                    |                                                                                    |
\ No newline at end of file
diff --git a/docs/introduction.md b/docs/introduction.md
index 685e1ef1a9..77699c6d8f 100644
--- a/docs/introduction.md
+++ b/docs/introduction.md
@@ -1,5 +1,5 @@
 ---
-title: InLong Introduction
+title: Introduction
 sidebar_position: 1
 ---
 
@@ -45,10 +45,10 @@ values={[
 {label: 'Standard', value: 'standard'},
 {label: 'Lightweight', value: 'lightweight'},
 ]}>
-<TabItem value="standard">
+<TabItem value="standard"> Standard Architecture: contains all InLong components such as InLong Agent, Manager, MQ, Sort, Dashboard, which is suitable for massive data and large-scale production environments.
 <img src="/img/inlong-structure-en.png" align="center" alt="Apache InLong"/>
 </TabItem>
-<TabItem value="lightweight">
+<TabItem value="lightweight"> Lightweight Architecture: contains only one component of InLong Sort, which also can be used with Manager, Dashboard, and it is simple and flexible, suitable for small-scale data.
 <img src="/img/inlong-structure-light.png" align="center" style={{objectFit: 'cover', height: '460px', width: '860px'}} alt="Apache InLong"/>
 </TabItem>
 </Tabs>
diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md
index 3195b2850a..2f7e1733d2 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md
@@ -3,16 +3,18 @@ title: 基本概念
 sidebar_position: 1
 ---
 
-| Name                      | Description                                                  | Other                                         |
-|---------------------------|--------------------------------------------------------------|-----------------------------------------------|
-| Standard Architecture     | 标准架构,包含 InLong Agent/Manager/MQ/Sort/Dashboard 等所有 InLong 组件 | 适合海量数据、大规模生产环境                                |
-| Lightweight Architecture  | 轻量化架构,只包含 InLong Sort 一个组件,可以搭配 Manager/Dashboard 一起使用       | 轻量化架构简单、灵活,适合小规模数据                            |
-| Group                     | 数据流组,包含多个数据流,一个Group 代表一个数据接入                                | Group 有ID、Name 等属性                            |
-| Stream                    | 数据流,一个数据流有具体的流向                                              | Stream 有ID、Name、数据字段等属性                       |
-| Node                      | 数据节点,包括`Extract Node` 和 `Load Node`,分别代表数据源类型和数据流向目标类型       |                                               |
-| InLongMsg                 | InLong 数据格式,如果从消息队列中直接消费,需要先进行 `InLongMsg` 解析                |                                               |
-| Agent                     | 标准架构使用 Agent 进行数据采集,Agent 代表不同类型的采集能力                        | 包含文件 Agent、SQL Agent、Binlog Agent 等           |
-| DataProxy                 | 将接收到的数据转发到不同的消息队列                                            | 支持数据发送阻塞和落盘重发                                 |
-| Sort                      | 数据流分拣                                                        | 主要有基于 Flink 的 sort-flink,sort-standalone 本地分拣 |
-| TubeMQ                    | InLong 自带的消息队列服务                                             | 也可以叫 Tube,拥有低成本、高性能特性                         |
-| Pulsar                    | 即[Apache Pulsar](https://pulsar.apache.org/), 高性能、高一致性消息队列服务 |                                               |
\ No newline at end of file
+| Name        | Description                                                  | Other                                         |
+|-------------|--------------------------------------------------------------|-----------------------------------------------|
+| Standard    | 标准架构,包含 InLong Agent、Manager、MQ、Sort、Dashboard 等所有 InLong 组件 | 适合海量数据、大规模生产环境                                |
+| Lightweight | 轻量化架构,只包含 InLong Sort 一个组件,也可以搭配 Manager,Dashboard 一起使用      | 轻量化架构简单、灵活,适合小规模数据                            |
+| Group       | 数据流组,包含多个数据流,一个 Group 代表一个数据业务单元                             | Group 有 ID、Name 等属性                           |
+| Stream      | 数据流,一个数据流有具体的数据源、数据格式和数据流向                                   | Stream 有 ID、Name、数据字段等属性                      |
+| Node        | 数据节点,包括 `Extract Node` 和 `Load Node`,分别代表数据源类型和数据流向目标类型      |                                               |
+| InLongMsg   | InLong 数据格式,如果从消息队列中直接消费,需要先进行 `InLongMsg` 解析                |                                               |
+| Cluster     | 集群,每个组件都可以构成单一集群。                                            | 包含集群名、标签,每个组件必要信息等                            |
+| Tag         | 标签,不同组件的集群可以使用相同标签,代表一组数据流执行单元                               | 目前标签只适用于集群                                    |
+| Agent       | 标准架构使用 Agent 进行数据采集,Agent 代表不同类型的采集能力                        | 包含文件 Agent、SQL Agent、Binlog Agent 等           |
+| DataProxy   | 将接收到的数据转发到不同的消息队列                                            | 支持数据发送阻塞和落盘重发                                 |
+| Sort        | 数据流分拣                                                        | 主要有基于 Flink 的 sort-flink,sort-standalone 本地分拣 |
+| TubeMQ      | InLong 自带的消息队列服务                                             | 也可以叫 Tube,拥有低成本、高性能特性                         |
+| Pulsar      | 即 [Apache Pulsar](https://pulsar.apache.org/),高性能、高一致性消息队列服务 |                                               |
\ No newline at end of file
diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
index 2db4d30874..b452256aa2 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
@@ -1,5 +1,5 @@
 ---
-title: InLong 简介
+title: 简介
 sidebar_position: 1
 ---
 
@@ -39,13 +39,13 @@ Apache InLong 依托 10 万亿级别的数据接入和处理能力,整合了
 <Tabs
 defaultValue="standard"
 values={[
-{label: 'Standard', value: 'standard'},
-{label: 'Lightweight', value: 'lightweight'},
+{label: '标准', value: 'standard'},
+{label: '轻量化', value: 'lightweight'},
 ]}>
-<TabItem value="standard">
+<TabItem value="standard"> 标准架构:包含 InLong Agent、Manager、MQ、Sort、Dashboard 等所有 InLong 组件,适合海量数据、大规模生产环境。
 <img src="/img/inlong-structure-zh.png" align="center" alt="Apache InLong"/>
 </TabItem>
-<TabItem value="lightweight">
+<TabItem value="lightweight"> 轻量化架构:只包含 InLong Sort 一个组件,也可以搭配 Manager,Dashboard 一起使用。轻量化架构简单、灵活,适合小规模数据。
 <img src="/img/inlong-structure-light.png" align="center" style={{objectFit: 'cover', height: '460px', width: '860px'}} alt="Apache InLong"/>
 </TabItem>
 </Tabs>
@@ -53,7 +53,7 @@ values={[
 ## 模块
 Apache InLong 服务于数据采集到落地的整个生命周期,按数据的不同阶段提供不同的处理模块,主要包括:
 
-- **inlong-agent**,数据采集服务,包括文件采集、DB采集等。
+- **inlong-agent**,数据采集服务,包括文件采集、DB 采集等。
 - **inlong-dataproxy**,一个基于 Flume-ng 的 Proxy 组件,支持数据发送阻塞和落盘重发,拥有将接收到的数据转发到不同 MQ(消息队列)的能力。
 - **inlong-tubemq**,腾讯自研的消息队列服务,专注于大数据场景下海量数据的高性能存储和传输,在海量实践和低成本方面有着良好的核心优势。
 - **inlong-sort**,对从不同的 MQ 消费到的数据进行 ETL 处理,然后汇聚并写入 Hive、ClickHouse、Hbase、Iceberg 等存储系统。