You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@reef.apache.org by "Dhruv Mahajan (JIRA)" <ji...@apache.org> on 2016/10/25 00:26:58 UTC
[jira] [Created] (REEF-1653) Request for evaluator with multiple
cores always return evaluator with 1 core only in IMRU
Dhruv Mahajan created REEF-1653:
-----------------------------------
Summary: Request for evaluator with multiple cores always return evaluator with 1 core only in IMRU
Key: REEF-1653
URL: https://issues.apache.org/jira/browse/REEF-1653
Project: REEF
Issue Type: Bug
Components: REEF.NET
Environment: C#
Reporter: Dhruv Mahajan
Currently, on submitting request for evaluators with multiple cores in IMRU to YARN cluster, we always get back evaluators with 1 core only. Memory allocated is right and higher rounded off value than requested which is the desired behavior. However, number of cores has issues. Find below relevant logs.
I asked for evaluator with 7000MB memory and 7 cores. In return I got one with around 9000MB memory and 1 core only which does not seem good right? Find below relevant logs from driver:
Org.Apache.REEF.IMRU.OnREEF.Driver.IMRUDriver`4[[TLCPP.LBFGS.MapFunctionInputOutput.MapFunctionInput, TLCPP, Version=1.0.0.0, Culture=neutral, PublicKeyToken=null],[TLCPP.LBFGS.MapFunctionInputOutput.MapFunctionOutput, TLCPP, Version=1.0.0.0, Culture=neutral, PublicKeyToken=null],[Microsoft.MachineLearning.Data.VBuffer`1[[System.Single, mscorlib, Version=4.0.0.0, Culture=neutral, PublicKeyToken=b77a5c561934e089]], Microsoft.MachineLearning.Core, Version=3.4.15.0, Culture=neutral, PublicKeyToken=d353f9ba84f0e281],[Microsoft.MachineLearning.Data.RoleMappedData, Microsoft.MachineLearning.Core, Version=3.4.15.0, Culture=neutral, PublicKeyToken=d353f9ba84f0e281]] Information: 0 : 2016-10-20T20:32:04.0144450+00:00 0001
{color:red}
INFO: map task memory:7000, update task memory:7000, map task cores:7, update task cores:7, maxRetry 10, allowedFailedEvaluators 4.
{color}
INFO: *** Start time is 10/20/2016 8:32:04 PM
Org.Apache.REEF.Driver.Bridge.Events.EvaluatorRequestor Information: 0 : 2016-10-20T20:32:04.0925881+00:00 0001
{color:red}
INFO: Submitting request for 1 evaluators and 7000 MB memory and 7 core to rack and runtime .
{color}
INFO: Allocated Evaluator: container_1475922122639_0016_01_000002, total running running 0
Oct 20, 2016 8:32:06 PM org.apache.reef.javabridge.AllocatedEvaluatorBridge getEvaluatorDescriptorString
{color:red}
INFO: allocated evaluator - serialized evaluator descriptor: IP=xyz, Port=45454, HostName=xyz, Memory=9216, Core=1, RuntimeName=Yarn
{color}
Oct 20, 2016 8:32:08 PM org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl populateNMTokens
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)