You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Clara Xiong (Jira)" <ji...@apache.org> on 2020/07/09 20:58:00 UTC

[jira] [Commented] (HBASE-24633) Remove data locality and StoreFileCostFunction for replica regions out of balancer's cost calculation

    [ https://issues.apache.org/jira/browse/HBASE-24633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17154924#comment-17154924 ] 

Clara Xiong commented on HBASE-24633:
-------------------------------------

[StochasticLoadBalancer|https://github.com/apache/hbase/pull/2003/files#diff-0a0a26a744f6fa61f913310da4adbc8f] is terribly slow. Nice to simplify. I wonder if we can overhaul it completely, more heuristics  than pure randomness. Will need some brainstorming.

> Remove data locality and StoreFileCostFunction for replica regions out of balancer's cost calculation
> -----------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-24633
>                 URL: https://issues.apache.org/jira/browse/HBASE-24633
>             Project: HBase
>          Issue Type: Improvement
>          Components: Balancer
>    Affects Versions: 2.3.0
>            Reporter: Huaxiang Sun
>            Assignee: Huaxiang Sun
>            Priority: Major
>
> We found one of the clusters with read replica enabled always balance lots of replica regions. going through the balancer's cost functions, found that data locality and StoreFileCost have same multiplier for both primary and replica regions. That is something we can improve. Data locality for replica regions should not be a dominant factor for balancer. We can either remove it out of balancer's picture for now and give it a small multiplier.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)