You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hbase.apache.org by "Bryan Beaudreault (Jira)" <ji...@apache.org> on 2022/12/13 14:03:00 UTC

[jira] [Created] (HBASE-27531) AsyncRequestFutureImpl unnecessarily clears meta cache for full server

Bryan Beaudreault created HBASE-27531:
-----------------------------------------

Summary: AsyncRequestFutureImpl unnecessarily clears meta cache for full server
Key: HBASE-27531
URL: https://issues.apache.org/jira/browse/HBASE-27531
Project: HBase
Issue Type: Improvement
Reporter: Bryan Beaudreault

In AsyncRequestFutureImpl.receiveGlobalFailure, the meta cache is cleared twice. First the entire cache is cleared for the serverName, then each individual region's cache is further cleared. See [here.|https://github.com/apache/hbase/blob/branch-2/hbase-client/src/main/java/org/apache/hadoop/hbase/client/AsyncRequestFutureImpl.java#L766-L780]

This behavior was introduced in HBASE-19900, which was a Jira more related to return value corruption rather than meta cache. I believe the change was an accident in the refactoring. Take a look at the commit [here.|https://github.com/apache/hbase/commit/e2636c81f7700a34368d73079375dbce6e180547#diff-7c58ffd83c150488599591ed5a3a068599646ebdbbbfdcd2233386e5472cca35L709]

In the commit, the cleanServerCache call should have been wrapped in tableName == null. Then the updateCachedLocations call should have been wrapped in tableName != null. So we should only go down one path or another based on the existence of a tableName.

Full server cache clears are extremely expensive in a high concurrency client. This is especially true in AsyncProcess, where a single request might hit many servers. We should reinstate the tableName checks so that we only do one or the other.

--
This message was sent by Atlassian Jira
(v8.20.10#820010)