You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrew Kyle Purtell (Jira)" <ji...@apache.org> on 2020/11/23 19:12:00 UTC
[jira] [Resolved] (HBASE-24664) Some changing of split region by
overall region size rather than only one store size
[ https://issues.apache.org/jira/browse/HBASE-24664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andrew Kyle Purtell resolved HBASE-24664.
-----------------------------------------
Hadoop Flags: Reviewed
Resolution: Fixed
> Some changing of split region by overall region size rather than only one store size
> ------------------------------------------------------------------------------------
>
> Key: HBASE-24664
> URL: https://issues.apache.org/jira/browse/HBASE-24664
> Project: HBase
> Issue Type: Improvement
> Components: regionserver
> Affects Versions: 3.0.0-alpha-1, 2.4.0
> Reporter: Zheng Wang
> Assignee: Zheng Wang
> Priority: Major
> Fix For: 3.0.0-alpha-1, 2.4.0
>
>
> As a distributed cluster, HBase distribute loads in unit of region, so if region grows too big,
> it will bring some negative effects, such as:
> 1. Harder to homogenize disk usage(consider locality)
> 2. Might cost more time on region opening
> 3. After split, the daughter region might lead to more io cost on compaction in a short time(if write evenly)
> I tried to introduce a new SteppingAllStoresSizeSplitPolicy in HBASE-24530, but after discussed in comments and related [thread|https://lists.apache.org/thread.html/r08a8103e2532eb667a0fcb4efa8a4117b3f82e6251bc4bd0bc157c26%40%3Cdev.hbase.apache.org%3E], finally we decide to change the existing split policy with a new option that if it should count all store files, and for master it would be true, else false.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)