You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Mike Roark (JIRA)" <ji...@apache.org> on 2013/03/21 18:01:17 UTC
[jira] [Commented] (MAPREDUCE-3859) CapacityScheduler incorrectly
utilizes extra-resources of queue for high-memory jobs
[ https://issues.apache.org/jira/browse/MAPREDUCE-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13609117#comment-13609117 ]
Mike Roark commented on MAPREDUCE-3859:
---------------------------------------
Any updates on this bug? It is affecting us as well.. it has a pretty bad effect on cluster utilization for these kinds of jobs. I tested this locally to see if it was this issue, results below:
10 datanodes, each with 6 slots for reducers, 60 reducer slots total. No other jobs running.
Running jobs in a queue which has "Reduce tasks, Capacity: 4 slots, Maximum capacity: 60 slots"
Reducer sleeps for a while. This allows me to check steady state reducer slot allocation.
||mapred.reduce.tasks||slots per reducer||expected running reduce tasks||running reduce tasks||using slots||percent of capacity||notes||
|30|1|30|30|30|750| |
|30|2|30|{color:red}16{color}|32|800| |
|30|3|20|{color:red}1{color}|3|75| really bad |
|30|4|10|{color:red}8{color}|32|800| |
|30|5|10|{color:red}8{color}|40|1000| |
|30|6|10|{color:red}8{color}|48|1200| |
|30|7(err)|0|0|0|0| job hangs, but I expected this |
> CapacityScheduler incorrectly utilizes extra-resources of queue for high-memory jobs
> ------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-3859
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3859
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: capacity-sched
> Affects Versions: 1.0.0
> Environment: CDH3u1
> Reporter: Sergey Tryuber
> Attachments: test-to-fail.patch.txt
>
>
> Imagine, we have a queue A with capacity 10 slots and 20 as extra-capacity, jobs which use 3 map slots will never consume more than 9 slots, regardless how many free slots on a cluster.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira