You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Xiaolin Ha (Jira)" <ji...@apache.org> on 2021/03/31 11:53:00 UTC

[jira] [Created] (HBASE-25720) Sync WAL stuck when prepare flush cache will prevent flush cache and cause OOM

Xiaolin Ha created HBASE-25720:
----------------------------------

             Summary: Sync WAL stuck when prepare flush cache will prevent flush cache and cause OOM
                 Key: HBASE-25720
                 URL: https://issues.apache.org/jira/browse/HBASE-25720
             Project: HBase
          Issue Type: Improvement
    Affects Versions: 1.4.13
            Reporter: Xiaolin Ha
            Assignee: Xiaolin Ha
         Attachments: prepare-flush-cache-stuck.png

We call HRegion#doSyncOfUnflushedWALChanges when preparing to flush cache. But this WAL sync may stuck, and abort the flush of cache. 

!prepare-flush-cache-stuck.png|width=519,height=246!

If we cannot aware of this problem in time, RS will OOM kill.

I think we should force abort RS when sync stuck in preparing, like in committing snapshots.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)