You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Karthik Palanisamy (Jira)" <ji...@apache.org> on 2019/09/30 05:08:00 UTC
[jira] [Created] (HBASE-23095) Reuse FileStatus in StoreFileInfo
Karthik Palanisamy created HBASE-23095:
------------------------------------------
Summary: Reuse FileStatus in StoreFileInfo
Key: HBASE-23095
URL: https://issues.apache.org/jira/browse/HBASE-23095
Project: HBase
Issue Type: Improvement
Components: mob, snapshots
Affects Versions: 2.2.1
Reporter: Karthik Palanisamy
Assignee: Karthik Palanisamy
Fix For: 3.0.0
Attachments: PerformanceComparision.pdf
The performance of create snapshot on large MOB table reasonably slow because there are two unnecessary calls to namenode on each Hfile, this while we create snapshot manifest. The first namenode call for getting StoreFile modification time [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileInfo.java#L139]] which used for metrics and another namenode call for getting StoreFile size [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/SnapshotManifestV2.java#L132]] which used in snapshot manifest. Both calls can be avoided and this info can be fetched from existing FileStatus [link|[https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileInfo.java#L155]].
PFA. 2x performance is seen after reusing existing FileStatus.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)