You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2019/02/11 22:59:00 UTC

[jira] [Commented] (HBASE-21577) do not close regions when RS is dying due to a broken WAL

    [ https://issues.apache.org/jira/browse/HBASE-21577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16765494#comment-16765494 ] 

Sergey Shelukhin commented on HBASE-21577:
------------------------------------------

[~busbey] does this patch make sense to you? small patch. We see RS taking a very long time to shutdown when HDFS produces a lot of failures.

> do not close regions when RS is dying due to a broken WAL
> ---------------------------------------------------------
>
>                 Key: HBASE-21577
>                 URL: https://issues.apache.org/jira/browse/HBASE-21577
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, wal
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>            Priority: Critical
>         Attachments: HBASE-21577.master.001.patch, HBASE-21577.master.002.patch
>
>
> See HBASE-21576. DroppedSnapshot can be an FS failure; also, when WAL is broken, some regions whose flushes are already in flight keep retrying, resulting in minutes-long shutdown times. Since WAL will be replayed anyway flushing regions doesn't provide much benefit.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)