You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hbase.apache.org by "Andrew Kyle Purtell (Jira)" <ji...@apache.org> on 2022/06/17 15:31:00 UTC

[jira] [Resolved] (HBASE-10121) Abort wedged Calls after a timeout

     [ https://issues.apache.org/jira/browse/HBASE-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrew Kyle Purtell resolved HBASE-10121.
-----------------------------------------
    Resolution: Invalid

> Abort wedged Calls after a timeout
> ----------------------------------
>
>                 Key: HBASE-10121
>                 URL: https://issues.apache.org/jira/browse/HBASE-10121
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.94.11
>            Reporter: Andrew Kyle Purtell
>            Priority: Major
>         Attachments: screenshot.jpg
>
>
> Saw this on a mail to user@. 
> "REPL IPC Server handler $N on $PORT WAITING Waiting for a call (since 22 hrs, 57mins, 38sec ago)"
> I don't think this is a TCP level issue. We are enabling keepalives on connections by default. Either we failed to remove the call upon exception or the remote is alive but not sending.
> Looking at the IPC server code, I don't see where we abort and clean up wedged Calls after some timeout. Regardless of the other issues here, should we do that?



--
This message was sent by Atlassian Jira
(v8.20.7#820007)