You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Duo Zhang (Jira)" <ji...@apache.org> on 2022/03/17 15:12:00 UTC

[jira] [Reopened] (HBASE-26833) Avoid waiting to clear buffer usage of ReplicationSourceShipper when aborting the RS

     [ https://issues.apache.org/jira/browse/HBASE-26833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Duo Zhang reopened HBASE-26833:
-------------------------------

Reopen for applying an addendum.

It introduces NPE in TestReplicationSource.testTerminateClearsBuffer. Just a test issue, we do not have server set up when mocking.

> Avoid waiting to clear buffer usage of ReplicationSourceShipper when aborting the RS
> ------------------------------------------------------------------------------------
>
>                 Key: HBASE-26833
>                 URL: https://issues.apache.org/jira/browse/HBASE-26833
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, Replication
>    Affects Versions: 2.4.10
>            Reporter: Xiaolin Ha
>            Assignee: Xiaolin Ha
>            Priority: Major
>             Fix For: 2.5.0, 2.6.0, 3.0.0-alpha-3, 2.4.11
>
>
> HBASE-24813 introduced the clear of buffer used in replication source shipper, but there is sleep in the method, if the variable sleepForRetries has a large value, and there are many wal groups, the aborting of RS may last a long time, but we should only do some necessary things in the aborting progress.
> {code:java}
> void clearWALEntryBatch() {
>   long timeout = System.currentTimeMillis() + this.shipEditsTimeout;
>   while(this.isAlive() || this.entryReader.isAlive()){
>     try {
>       if (System.currentTimeMillis() >= timeout) {
>         LOG.warn("Shipper clearWALEntryBatch method timed out whilst waiting reader/shipper "
>           + "thread to stop. Not cleaning buffer usage. Shipper alive: {}; Reader alive: {}",
>           this.source.getPeerId(), this.isAlive(), this.entryReader.isAlive());
>         return;
>       } else {
>         // Wait both shipper and reader threads to stop
>         Thread.sleep(this.sleepForRetries);
>       }
>     } catch (InterruptedException e) {
>       LOG.warn("{} Interrupted while waiting {} to stop on clearWALEntryBatch. "
>           + "Not cleaning buffer usage: {}", this.source.getPeerId(), this.getName(), e);
>       return;
>     }
>   }
> ...... {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)