You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Michael Stack (Jira)" <ji...@apache.org> on 2021/03/12 05:24:00 UTC

[jira] [Resolved] (HBASE-25566) RoundRobinTableInputFormat

     [ https://issues.apache.org/jira/browse/HBASE-25566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael Stack resolved HBASE-25566.
-----------------------------------
    Fix Version/s: 2.4.2
                   2.3.5
                   3.0.0-alpha-1
     Hadoop Flags: Reviewed
     Release Note: Adds RoundRobinTableInputFormat, a subclass of TableInputFormat, that takes the TIF#getSplits list and resorts it so as to spread the InputFormats as broadly about the cluster as possible. RRTIF works to frustrate bunching of InputSplits on RegionServers to avoid the scenario where a few RegionServers are working hard fielding many InputSplits while others idle hosting a few or none.
       Resolution: Fixed

> RoundRobinTableInputFormat
> --------------------------
>
>                 Key: HBASE-25566
>                 URL: https://issues.apache.org/jira/browse/HBASE-25566
>             Project: HBase
>          Issue Type: Improvement
>          Components: mapreduce
>            Reporter: Michael Stack
>            Assignee: Sudhir Reddy
>            Priority: Major
>             Fix For: 3.0.0-alpha-1, 2.3.5, 2.4.2
>
>
> [~huaxiang] figured we needed a round robin on the result returned out TableInputFormat to better distribute task on a production cluster at our place-of-employ.
> This seems to be an old problem. See this helpful, old Flipkart blog: https://tech.flipkart.com/is-data-locality-always-out-of-the-box-in-hadoop-not-really-2ae9c95163cb



--
This message was sent by Atlassian Jira
(v8.3.4#803005)