You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2011/08/10 04:41:27 UTC

[jira] [Resolved] (HBASE-3331) Kill -STOP of RS hosting META does not recover

     [ https://issues.apache.org/jira/browse/HBASE-3331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Todd Lipcon resolved HBASE-3331.
--------------------------------

    Resolution: Cannot Reproduce

Thanks for trying to reproduce, Ming. I'll resolve it as Cannot Reproduce.

> Kill -STOP of RS hosting META does not recover
> ----------------------------------------------
>
>                 Key: HBASE-3331
>                 URL: https://issues.apache.org/jira/browse/HBASE-3331
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.90.0
>            Reporter: Todd Lipcon
>            Priority: Critical
>             Fix For: 0.92.0
>
>         Attachments: timeouts.log.txt
>
>
> If you find the server hosting META and kill -STOP its region server, it will eventually lose its ZK session and the master will split its logs and try to reassign. However, at some point along here it tries to access the old META, and gets SocketTimeoutExceptions, which cause it to keep retrying forever. Once I kill -9ed the stopped server, things came back to life.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira