You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2019/10/09 01:47:58 UTC

[GitHub] [incubator-hudi] leesf commented on a change in pull request #946: [docs][chinese] revert the PR(#900)

leesf commented on a change in pull request #946: [docs][chinese] revert the PR(#900)
URL: https://github.com/apache/incubator-hudi/pull/946#discussion_r332800188
 
 

 ##########
 File path: docs/use_cases.cn.md
 ##########
 @@ -65,4 +65,4 @@ Hadoop的一个流行用例是压缩数据,然后将其分发回在线服务
 例如,一个Spark管道可以[确定Hadoop上的紧急制动事件](https://eng.uber.com/telematics/)并将它们加载到服务存储层(如ElasticSearch)中,供Uber应用程序使用以增加安全驾驶。这种用例中,通常架构会在Hadoop和服务存储之间引入`队列`,以防止目标服务存储被压垮。
 对于队列的选择,一种流行的选择是Kafka,这个模型经常导致__在DFS上存储相同数据的冗余(用于计算结果的离线分析)和Kafka(用于分发)__
 
-通过将每次运行的Spark管道更新插入的输出转换为Hudi数据集,Hudi可以再次有效地解决这个问题,然后可以以增量方式获取尾部数据(就像Kafka主题一样)然后写入服务存储层。
\ No newline at end of file
+通过将每次运行的Spark管道更新插入的输出转换为Hudi数据集,Hudi可以再次有效地解决这个问题,然后可以以增量方式获取尾部数据(就像Kafka主题一样)然后写入服务存储层。
 
 Review comment:
   This line is automatically changed by Intellij IDEA, and it does not make sence, just ignore.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services