You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@streampark.apache.org by "xujiangfeng001 (via GitHub)" <gi...@apache.org> on 2023/05/06 06:44:13 UTC

[GitHub] [incubator-streampark] xujiangfeng001 commented on issue #2423: [Feature] flink cluster failure alarm&failover

xujiangfeng001 commented on issue #2423:
URL: https://github.com/apache/incubator-streampark/issues/2423#issuecomment-1537068906

   Hello everyone, after discussion, our solution for this issue is as follows:
   Requirement:
   
   - When Flink Cluster encounters an exception and is unable to run, alert the user and block the job alert notification running in the Flink Cluster
   
   Detailed logic:
   
   1. Flink Cluster implements mentality detection and status updates, as detailed in: https://github.com/apache/incubator-streampark/pull/2675
   2. When an exception occurs in a job, it is necessary to determine whether the job deployment mode is remote, yarn session or k8s session:If not, send the job alarm directly. If so, obtain the flink cluster status through the `Flink Cluster ID` of the job:If the flink cluster status is `STOP` or `LOST`,  block the job alarm and wait for the flink cluster alarm.If the status of flink cluster is `RUNNING`, actively trigger a flink cluster status update request to update the relevant status of flink cluster. If flink cluster is updated to `STOP` or `LOST` status in the latest update, the job alarm will be blocked; If the flink cluster status is still `RUNNING`, send an alarm notification for the job.
   3. Flink cluster alarm template uses job alarm template and adds information: number of affected jobs.
   4. Abstract the alarm template code to avoid code redundancy issues.
   
       
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@streampark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org