You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ratis.apache.org by "Tsz-wo Sze (Jira)" <ji...@apache.org> on 2022/07/21 18:16:00 UTC

[jira] [Resolved] (RATIS-1465) Use seperate channel for group heartbeat

     [ https://issues.apache.org/jira/browse/RATIS-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tsz-wo Sze resolved RATIS-1465.
-------------------------------
    Fix Version/s: 3.0.0
       Resolution: Fixed

The pull request is now merged.  Thanks, [~Sammi]!

> Use seperate channel for group heartbeat
> ----------------------------------------
>
>                 Key: RATIS-1465
>                 URL: https://issues.apache.org/jira/browse/RATIS-1465
>             Project: Ratis
>          Issue Type: Improvement
>          Components: server
>            Reporter: Sammi Chen
>            Assignee: Sammi Chen
>            Priority: Major
>             Fix For: 3.0.0
>
>         Attachments: 561_review.patch, follower-hb-process-latency-with-patch.png, follower-hb-process-latency.png, leader-hb-receive-latency-1.png, leader-hb-receive-latency-with-patch.png
>
>          Time Spent: 5.5h
>  Remaining Estimate: 0h
>
> In a heavy load read/write cluster,  frequent leader step down is observed because of lost the majority heartbeat. 
> The investigation shows that follower side heartbeat process is very quick, while the leader side heartbeat latency is high.  See the attached metrics diagram. 
> This task aims to use seperate grpc channel for heartbeat to reduce the latency introduced by the network queuing. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)