You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Ted Yu (JIRA)" <ji...@apache.org> on 2011/03/22 06:42:05 UTC

[jira] [Updated] (HBASE-3681) Check the sloppiness of the region load before balancing

     [ https://issues.apache.org/jira/browse/HBASE-3681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ted Yu updated HBASE-3681:
--------------------------

    Attachment: hbase-3681.txt

Initial attempt.
If slop is greater than 0, some tests fail.
e.g.
{code}
2011-03-21 22:18:44,886 INFO  [main] master.TestLoadBalancer(186): Mock Cluster : { 1 , 1 , 1123 , 133 , 138 , 12 , 1444 , 0 , 0 , 144 , 1 , 1 } [srvr=12 rgns=2998 avg=249.83333 max=250 min=249]
2011-03-21 22:18:44,887 WARN  [main] master.LoadBalancer(322): regionidx=2027, regionsToMove=2027, numServers=12, serversOverloaded=2, serversUnderloaded=10
2011-03-21 22:18:44,887 WARN  [main] master.LoadBalancer(332): Input 127.0.0.1,14173,-4503661418262151144 1, 127.0.0.1,16709,5660298165359772741 1, 127.0.0.1,16753,6033378928668609711 1123, 127.0.0.1,21997,-5339821095735763805 133, 127.0.0.1,25701,-2236437849271081337 138, 127.0.0.1,2735,5768301853710328270 12, 127.0.0.1,28795,1395900819118748885 1444, 127.0.0.1,41820,-7080037238717148768 0, 127.0.0.1,42315,-7850691226193331732 0, 127.0.0.1,55335,-5765170823713071701 144, 127.0.0.1,55580,-8175369880198350855 1, 127.0.0.1,8395,850167680932222284 1
2011-03-21 22:18:44,887 INFO  [main] master.LoadBalancer(336): Calculated a load balance in 1ms. Moving 2027 regions off of 2 overloaded servers onto 10 less loaded servers
2011-03-21 22:18:44,889 INFO  [main] master.TestLoadBalancer(189): Mock Balance : { 248 , 248 , 274 , 248 , 248 , 248 , 274 , 249 , 249 , 216 , 248 , 248 }
{code}


> Check the sloppiness of the region load before balancing
> --------------------------------------------------------
>
>                 Key: HBASE-3681
>                 URL: https://issues.apache.org/jira/browse/HBASE-3681
>             Project: HBase
>          Issue Type: Improvement
>    Affects Versions: 0.90.1
>            Reporter: Jean-Daniel Cryans
>            Assignee: Ted Yu
>             Fix For: 0.92.0
>
>         Attachments: hbase-3681.txt
>
>
> Per our discussion at the hackathon today, it seems that it would be more helpful to add a sloppiness check before doing the normal balancing.
> The current situation is that the balancer always tries to get the region load even, meaning that there can be some very frequent regions movement.
> Setting the balancer to run less often (like every 4 hours) isn't much better since the load could get out of whack easily.
> This is why running the normal balancer frequently, but first checking for some sloppiness in the region load across the RS, seems like a more viable option.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira