You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Jeff Zhang (JIRA)" <ji...@apache.org> on 2014/11/19 14:10:33 UTC
[jira] [Commented] (TEZ-1793) Setting different container sizes for
jobs in the same tez session can cause cluster underutilization
[ https://issues.apache.org/jira/browse/TEZ-1793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217878#comment-14217878 ]
Jeff Zhang commented on TEZ-1793:
---------------------------------
I think this can been resolved once YARN support container resize. Not sure when YARN-1197 will be resolved.
> Setting different container sizes for jobs in the same tez session can cause cluster underutilization
> -----------------------------------------------------------------------------------------------------
>
> Key: TEZ-1793
> URL: https://issues.apache.org/jira/browse/TEZ-1793
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Rajesh Balamohan
>
> set tez.am.container.idle.release-timeout-min.millis to 200000
> set tez.am.container.idle.release-timeout-max.millis to 500000
> set hive.tez.container.size to 4096 (similar to tez.task.resource.memory.mb)
> - Ran a hive query (ie with container size 4096) in tez mode. In the small cluster (with 720 GB total memory), this launched 180 containers.
> - Reset container size to 8192 and rerun the query. Now, it launched 90 containers.
> - Reset container size to 4096 and rerun the query. Now, it reuses the same 90 containers.
> This could lead to cluster underutilization where a large job's settings can cause adverse impact (e.g concurrency) on the subsequent small jobs.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)