You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by GitBox <gi...@apache.org> on 2022/03/16 14:06:01 UTC

[GitHub] [flink] pnowojski commented on a change in pull request #19021: [hotfix][docs] Clarify semantic of tolerable checkpoint failure number

pnowojski commented on a change in pull request #19021:
URL: https://github.com/apache/flink/pull/19021#discussion_r828045995



##########
File path: docs/content.zh/docs/dev/datastream/fault-tolerance/checkpointing.md
##########
@@ -61,6 +61,10 @@ Checkpoint 其他的属性包括:
 
   - *checkpoint 可容忍连续失败次数*:该属性定义可容忍多少次连续的 checkpoint 失败。超过这个阈值之后会触发作业错误 fail over。
     默认次数为“0”,这意味着不容忍 checkpoint 失败,作业将在第一次 checkpoint 失败时fail over。
+    This only applies to the following failure reasons: IOException on the Job Manager, failures in
+    the async phase on the Task Managers and checkpoint expiration due to a timeout. Failures
+    originating from the sync phase on the Task Managers are always forcing failover of an affected
+    task. Other types of checkpoint failures (such as checkpoint being subsumed) are being ignored。
     

Review comment:
       Thanks!




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@flink.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org