You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@inlong.apache.org by do...@apache.org on 2022/06/18 04:17:02 UTC

[incubator-inlong-website] branch master updated: [INLONG-4696][Doc] Update the supported data nodes (#421)

This is an automated email from the ASF dual-hosted git repository.

dockerzhang pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-inlong-website.git


The following commit(s) were added to refs/heads/master by this push:
     new 7372c0c86 [INLONG-4696][Doc] Update the supported data nodes (#421)
7372c0c86 is described below

commit 7372c0c866b4f1bd9ceeea52a081bf0537604bcc
Author: Charles Zhang <do...@apache.org>
AuthorDate: Sat Jun 18 12:16:58 2022 +0800

    [INLONG-4696][Doc] Update the supported data nodes (#421)
---
 docs/design_and_concept/basic_concept.md           | 26 ++++++++--------
 docs/introduction.md                               | 36 +++++++++++++++-------
 .../current/design_and_concept/basic_concept.md    | 26 ++++++++--------
 .../current/introduction.md                        | 36 +++++++++++++++-------
 4 files changed, 76 insertions(+), 48 deletions(-)

diff --git a/docs/design_and_concept/basic_concept.md b/docs/design_and_concept/basic_concept.md
index 7f7e9a7bf..3329aa979 100644
--- a/docs/design_and_concept/basic_concept.md
+++ b/docs/design_and_concept/basic_concept.md
@@ -3,16 +3,16 @@ title: Basic Concept
 sidebar_position: 1
 ---
 
-| Name       | Description                                                                                                                | Other                                                                 |
-|------------|----------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------|
-| Group      | Data Streams Group, it contains multiple data streams, and one Group represents one data ingestion.                        | Group has attributes such as ID and Name.                             |
-| Stream     | Data Stream, a stream has a specific flow direction.                                                                       | Stream has attributes such as ID, Name, and data fields.              |
-| Node       | Data Node, including `Extract Node` and `Load Node`,  stands for the data source and sink types separately.                |                                                                       |
-| InLongMsg  | InLong data format, if you consume message directly from the message queue, you need to perform `InLongMsg` parsing first. |                                                                       |
-| Agent      | Represents various collection capabilities.                                                                                | It contains File Agent, SQL Agent, Binlog Agent, etc.                 |
-| DataProxy  | Forward received data to different message queues.                                                                         | Supports data transmission blocking, placing retransmission.          |
-| Sort       | Data stream sorting                                                                                                        | Sort-flink based on Flink, sort-standalone for local sorting.         |
-| TubeMQ     | InLong's self-developed message queuing service                                                                            | It can also be called Tube, with low-cost, high-performance features. |
-| Pulsar     | [Apache Pulsar](https://pulsar.apache.org/), a high-performance, high-consistency message queue service                    |                                                                       |
-| Hive       | [Apache Hive](https://hive.apache.org/), a data warehouse built on the Hadoop architecture                                 |                                                                       |
-| ClickHouse | [ClickHouse](https://clickhouse.com/), a high performance columnar OLAP database                                           |                                                                       |
\ No newline at end of file
+| Name                     | Description                                                                                                                 | Other                                                                              |
+|--------------------------|-----------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------|
+| Standard Architecture    | Contains all InLong components such as InLong Agent/Manager/MQ/Sort/Dashboard                                               | Suitable for massive data and large-scale production environments                  |
+| Lightweight Architecture | Contains only one component of InLong Sort, which can be used with Manager/Dashboard                                        | The lightweight architecture is simple and flexible, suitable for small-scale data |
+| Group                    | Data Streams Group, it contains multiple data streams, and one Group represents one data ingestion.                         | Group has attributes such as ID and Name.                                          |
+| Stream                   | Data Stream, a stream has a specific flow direction.                                                                        | Stream has attributes such as ID, Name, and data fields.                           |
+| Node                     | Data Node, including `Extract Node` and `Load Node`,  stands for the data source and sink types separately.                 |                                                                                    |
+| InLongMsg                | InLong data format, if you consume message directly from the message queue, you need to perform `InLongMsg` parsing first.  |                                                                                    |
+| Agent                    | Represents various collection capabilities.                                                                                 | It contains File Agent, SQL Agent, Binlog Agent, etc.                              |
+| DataProxy                | Forward received data to different message queues.                                                                          | Supports data transmission blocking, placing retransmission.                       |
+| Sort                     | Data stream sorting                                                                                                         | Sort-flink based on Flink, sort-standalone for local sorting.                      |
+| TubeMQ                   | InLong's self-developed message queuing service                                                                             | It can also be called Tube, with low-cost, high-performance features.              |
+| Pulsar                   | [Apache Pulsar](https://pulsar.apache.org/), a high-performance, high-consistency message queue service                     |                                                                                    |
\ No newline at end of file
diff --git a/docs/introduction.md b/docs/introduction.md
index 8c339ec81..fe0bdd45f 100644
--- a/docs/introduction.md
+++ b/docs/introduction.md
@@ -52,14 +52,28 @@ Apache InLong serves the entire life cycle from data collection to landing,  and
 - **inlong-audit**, performs real-time audit and reconciliation on the incoming and outgoing traffic of the Agent, DataProxy, and Sort modules of the InLong system.
 
 ## Supported Data Nodes (Updating)
-| Type         | Name             | Version      | Other                                                                                                             |
-|--------------|------------------|--------------|-------------------------------------------------------------------------------------------------------------------|
-| Extract Node | Auto Push        | None         | Using [SDK](https://inlong.apache.org/docs/next/sdk/dataproxy-sdk/example) to send                                |
-|              | File             | None         | CSV, Key-Value, JSON, Avro                                                                                        |
-|              | Kafka            | 2.x          | Canal JSON                                                                                                        |
-|              | MySQL            | 5.x, 8.x     | Debezium JSON                                                                                                     |
-| Load Node    | Auto Consumption | None         | Using MQ SDK consume messages and [Parse InLongMsg](https://inlong.apache.org/docs/next/development/inlong_msg)   |
-|              | Hive             | 2.x          | TextFile, SequenceFile,OrcFile, Parquet, Avro                                                                     |
-|              | Iceberg          | 0.12.x       | Parquet, Orc, Avro                                                                                                |
-|              | ClickHouse       | v20+         | Canal JSON                                                                                                        |
-|              | Kafka            | 2.x          | JSON, Canal, Avro                                                                                                 |
\ No newline at end of file
+| Type         | Name              | Version                      | Architecture          |
+|--------------|-------------------|------------------------------|-----------------------|
+| Extract Node | Auto Push         | None                         | Standard              |
+|              | File              | None                         | Standard              |
+|              | Kafka             | 2.x                          | Lightweight, Standard |
+|              | MySQL             | 5.6, 5.7, 8.0.x              | Lightweight, Standard |
+|              | MongoDB           | >= 3.6                       | Lightweight           |
+|              | Oracle            | 11,12,19                     | Lightweight           |
+|              | PostgreSQL        | 9.6, 10, 11, 12              | Lightweight           |
+|              | Pulsar            | 2.8.x                        | Lightweight           |
+|              | SQLServer         | 2012, 2014, 2016, 2017, 2019 | Lightweight           |
+| Load Node    | Auto Consumption  | None                         | Standard              |
+|              | Hive              | 1.x, 2.x, 3.x                | Lightweight, Standard |
+|              | Iceberg           | 0.12.x                       | Lightweight, Standard |
+|              | ClickHouse        | 20.7+                        | Lightweight, Standard |
+|              | Kafka             | 2.x                          | Lightweight, Standard |
+|              | HBase             | 2.2.x                        | Lightweight, Standard |
+|              | PostgreSQL        | 9.6, 10, 11, 12              | Lightweight, Standard |
+|              | Oracle            | 11, 12, 19                   | Lightweight, Standard |
+|              | MySQL             | 5.6, 5.7, 8.0.x              | Lightweight, Standard |
+|              | TDSQL-PostgreSQL  | 10.17                        | Lightweight, Standard |
+|              | Greenplum         | 4.x, 5.x, 6.x                | Lightweight, Standard |
+|              | Elasticsearch     | 6.x, 7.x                     | Lightweight, Standard |
+|              | SQLServer         | 2012, 2014, 2016, 2017, 2019 | Lightweight, Standard |
+|              | HDFS              | 2.x, 3.x                     | Lightweight, Standard |
\ No newline at end of file
diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md
index b3a7e3a6f..b3fd1266b 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/design_and_concept/basic_concept.md
@@ -3,16 +3,16 @@ title: 基本概念
 sidebar_position: 1
 ---
 
-| Name       | Description                                                   | Other                                      |
-|------------|---------------------------------------------------------------|--------------------------------------------|
-| Group      | 数据流组,包含多个数据流,一个Group 代表一个数据接入                                 | Group 有ID、Name 等属性                         |
-| Stream     | 数据流,一个数据流有具体的流向                                               | Stream 有ID、Name、数据字段等属性                    |
-| Node       | 数据节点,包括`Extract Node` 和 `Load Node`,分别代表数据源类型和数据流向目标类型        |                                            |
-| InLongMsg  | InLong 数据格式,如果从消息队列中直接消费,需要先进行`InLongMsg` 解析                  |                                            |
-| Agent      | 代表各种采集能力                                                      | 包含文件Agent、SQL Agent、Binlog Agent 等         |
-| DataProxy  | 将接收到的数据转发到不同的消息队列                                             | 支持数据发送阻塞和落盘重发                              |
-| Sort       | 数据流分拣                                                         | 主要有基于Flink的sort-flink,sort-standalone 本地分拣 |
-| TubeMQ     | InLong自带的消息队列服务                                               | 也可以叫Tube,拥有低成本、高性能特性                       |
-| Pulsar     | 即[Apache Pulsar](https://pulsar.apache.org/), 高性能、高一致性消息队列服务  |                                            |
-| Hive       | 即[Apache Hive](https://hive.apache.org/),一个建立在Hadoop架构之上的数据仓库 |                                            |
-| ClickHouse | [ClickHouse](https://clickhouse.com/),高性能列式OLAP 数据库           |                                            |
\ No newline at end of file
+| Name                      | Description                                                   | Other                                      |
+|---------------------------|---------------------------------------------------------------|--------------------------------------------|
+| Standard Architecture     | 标准架构,包含 InLong Agent/Manager/MQ/Sort/Dashboard 等所有 InLong 组件  | 适合海量数据、大规模生产环境                             |
+| Lightweight Architecture  | 轻量化架构,只包含 InLong Sort 一个组件,可以搭配 Manager/Dashboard 一起使用        | 轻量化架构简单、灵活,适合小规模数据                         |
+| Group                     | 数据流组,包含多个数据流,一个Group 代表一个数据接入                                 | Group 有ID、Name 等属性                         |
+| Stream                    | 数据流,一个数据流有具体的流向                                               | Stream 有ID、Name、数据字段等属性                    |
+| Node                      | 数据节点,包括`Extract Node` 和 `Load Node`,分别代表数据源类型和数据流向目标类型        |                                            |
+| InLongMsg                 | InLong 数据格式,如果从消息队列中直接消费,需要先进行`InLongMsg` 解析                  |                                            |
+| Agent                     | 代表各种采集能力                                                      | 包含文件Agent、SQL Agent、Binlog Agent 等         |
+| DataProxy                 | 将接收到的数据转发到不同的消息队列                                             | 支持数据发送阻塞和落盘重发                              |
+| Sort                      | 数据流分拣                                                         | 主要有基于Flink的sort-flink,sort-standalone 本地分拣 |
+| TubeMQ                    | InLong自带的消息队列服务                                               | 也可以叫Tube,拥有低成本、高性能特性                       |
+| Pulsar                    | 即[Apache Pulsar](https://pulsar.apache.org/), 高性能、高一致性消息队列服务  |                                            |
\ No newline at end of file
diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
index d2545a237..6feb46dea 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
@@ -46,16 +46,30 @@ Apache InLong 服务于数据采集到落地的整个生命周期,按数据的
 - **inlong-audit**,对InLong系统的Agent、DataProxy、Sort模块的入流量、出流量进行实时审计对账。
 
 ## 已支持数据节点(更新中)
-| 类型           | 名称            | 版本           | 备注                                                                                                       |
-|--------------|---------------|--------------|---------------------------------------------------------------------------------------------------------------|
-| Extract Node | 自主推送          | 无            | 使用 [SDK](https://inlong.apache.org/zh-CN/docs/next/sdk/dataproxy-sdk/example) 发送                         |
-|              | File          | 无            | CSV, Key-Value, JSON, Avro                                                                                   |
-|              | Kafka         | 2.x          | Canal JSON                                                                                                    |
-|              | MySQL         | 5.x, 8.x     | Debezium JSON                                                                                                 |
-| Load Node    | 自主消费          | 无            | 使用 MQ SDK 消费后再[解析 InLongMsg](https://inlong.apache.org/zh-CN/docs/next/development/inlong_msg)        |
-|              | Hive          | 2.x          | TextFile, SequenceFile,OrcFile, Parquet, Avro                                                                 |
-|              | Iceberg       | 0.12.x       | Parquet, Orc, Avro                                                                                            |
-|              | ClickHouse    | v20+         | Canal JSON                                                                                                    |
-|              | Kafka         | 2.x          | JSON, Canal, Avro                                                                                             |
+| Type         | Name              | Version                      | Architecture          |
+|--------------|-------------------|------------------------------|-----------------------|
+| Extract Node | Auto Push         | None                         | Standard              |
+|              | File              | None                         | Standard              |
+|              | Kafka             | 2.x                          | Lightweight, Standard |
+|              | MySQL             | 5.6, 5.7, 8.0.x              | Lightweight, Standard |
+|              | MongoDB           | >= 3.6                       | Lightweight           |
+|              | Oracle            | 11,12,19                     | Lightweight           |
+|              | PostgreSQL        | 9.6, 10, 11, 12              | Lightweight           |
+|              | Pulsar            | 2.8.x                        | Lightweight           |
+|              | SQLServer         | 2012, 2014, 2016, 2017, 2019 | Lightweight           |
+| Load Node    | Auto Consumption  | None                         | Standard              |
+|              | Hive              | 1.x, 2.x, 3.x                | Lightweight, Standard |
+|              | Iceberg           | 0.12.x                       | Lightweight, Standard |
+|              | ClickHouse        | 20.7+                        | Lightweight, Standard |
+|              | Kafka             | 2.x                          | Lightweight, Standard |
+|              | HBase             | 2.2.x                        | Lightweight, Standard |
+|              | PostgreSQL        | 9.6, 10, 11, 12              | Lightweight, Standard |
+|              | Oracle            | 11, 12, 19                   | Lightweight, Standard |
+|              | MySQL             | 5.6, 5.7, 8.0.x              | Lightweight, Standard |
+|              | TDSQL-PostgreSQL  | 10.17                        | Lightweight, Standard |
+|              | Greenplum         | 4.x, 5.x, 6.x                | Lightweight, Standard |
+|              | Elasticsearch     | 6.x, 7.x                     | Lightweight, Standard |
+|              | SQLServer         | 2012, 2014, 2016, 2017, 2019 | Lightweight, Standard |
+|              | HDFS              | 2.x, 3.x                     | Lightweight, Standard |