You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Krishna Kishore Bonagiri (JIRA)" <ji...@apache.org> on 2013/04/16 10:07:17 UTC
[jira] [Updated] (YARN-541) getAllocatedContainers() is not
returning all the allocated containers
[ https://issues.apache.org/jira/browse/YARN-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Krishna Kishore Bonagiri updated YARN-541:
------------------------------------------
Attachment: yarn-dsadm-resourcemanager-isredeng.out
yarn-dsadm-nodemanager-isredeng.out
AppMaster.stdout
Hi Hitesh,
I am attaching the logs for AM, RM, and NM. I have an application being
run in a loop, which requires 5 containers. The 8th run has failed with
this issue of getAllocatedContainers(). The Application Master couldn't get
all the 5 containers it required, the getAllocatedContainers() method
returned only 4. The RM's log is saying that the 5th container is also
allocated thro' the message,
2013-04-16 03:32:54,701 INFO [ResourceManager Event Processor]
rmcontainer.RMContainerImpl (RMContainerImpl.java:handle(220)) -
container_1366096597608_0008_01_000006 Container Transitioned from NEW to
ALLOCATED
In RM's log, you can see that this kind of for the remaining 4 containers
also, i.e. container_1366096597608_0008_01_000002 to
container_1366096597608_0008_01_000005.
Also, as I said before this issue is seen randomly.
Thanks,
Kishore
> getAllocatedContainers() is not returning all the allocated containers
> ----------------------------------------------------------------------
>
> Key: YARN-541
> URL: https://issues.apache.org/jira/browse/YARN-541
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 2.0.3-alpha
> Environment: Redhat Linux 64-bit
> Reporter: Krishna Kishore Bonagiri
> Attachments: AppMaster.stdout, yarn-dsadm-nodemanager-isredeng.out, yarn-dsadm-resourcemanager-isredeng.out
>
>
> I am running an application that was written and working well with the hadoop-2.0.0-alpha but when I am running the same against 2.0.3-alpha, the getAllocatedContainers() method called on AMResponse is not returning all the containers allocated sometimes. For example, I request for 10 containers and this method gives me only 9 containers sometimes, and when I looked at the log of Resource Manager, the 10th container is also allocated. It happens only sometimes randomly and works fine all other times. If I send one more request for the remaining container to RM after it failed to give them the first time(and before releasing already acquired ones), it could allocate that container. I am running only one application at a time, but 1000s of them one after another.
> My main worry is, even though the RM's log is saying that all 10 requested containers are allocated, the getAllocatedContainers() method is not returning me all of them, it returned only 9 surprisingly. I never saw this kind of issue in the previous version, i.e. hadoop-2.0.0-alpha.
> Thanks,
> Kishore
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira