You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hbase.apache.org by "Duo Zhang (JIRA)" <ji...@apache.org> on 2019/01/24 00:38:00 UTC

[jira] [Commented] (HBASE-21767) findRegionsToForceFlush could be improved

    [ https://issues.apache.org/jira/browse/HBASE-21767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16750584#comment-16750584 ] 

Duo Zhang commented on HBASE-21767:
-----------------------------------

Agree.

IIRC for now we only consider the oldest file, maybe we should check all the files over the limit to find out regions to flush?

> findRegionsToForceFlush could be improved
> -----------------------------------------
>
>                 Key: HBASE-21767
>                 URL: https://issues.apache.org/jira/browse/HBASE-21767
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Sergey Shelukhin
>            Priority: Minor
>         Attachments: image-2019-01-23-16-07-42-166.png
>
>
> We see Too many WALs message logged a lot, and it seems like it usually fails to bring the WAL count back down for a long time, sometimes hours. This results in large steady state WAL volume on some region servers, and slower recovery.
> Based on the functionality in HBASE-21626, it should be possible to add an option to be aggressive in this method, and to determine the minimum set of regions that will actually bring WAL count close to a limit in one operation.
> An example of too-many-WALs log statements reporting WAL count over ~2.5 hours, with a limit of 79. You can see when the count is coming down; it often doesn't come anywhere close to the limit so another flush is requested, etc. for a long time.
>  !image-2019-01-23-16-07-42-166.png! 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)