You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "MENG DING (JIRA)" <ji...@apache.org> on 2015/11/03 01:09:27 UTC

[jira] [Updated] (YARN-4326) TestDistributedShell timeout as AHS in MiniYarnCluster no longer binds to default port 8188

     [ https://issues.apache.org/jira/browse/YARN-4326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

MENG DING updated YARN-4326:
----------------------------
    Attachment: YARN-4326.patch

Fix the problem by setting the {{TIMELINE_SERVICE_WEBAPP_ADDRESS}} after MiniYARNCluster is started.

The TestDistributedShell tests are passed now:
{code}
-------------------------------------------------------
 T E S T S
-------------------------------------------------------
Running org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell
Tests run: 11, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 364.886 sec - in org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell
Running org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShellWithNodeLabels
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 37.699 sec - in org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShellWithNodeLabels

Results :

Tests run: 12, Failures: 0, Errors: 0, Skipped: 0
{code}

> TestDistributedShell timeout as AHS in MiniYarnCluster no longer binds to default port 8188
> -------------------------------------------------------------------------------------------
>
>                 Key: YARN-4326
>                 URL: https://issues.apache.org/jira/browse/YARN-4326
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: MENG DING
>            Assignee: MENG DING
>         Attachments: YARN-4326.patch
>
>
> The timeout originates in ApplicationMaster, where it fails to connect to timeline server, and retry exceeds limits:
> {code}
> 2015-11-02 21:57:38,066 INFO  [main] impl.TimelineClientImpl (TimelineClientImpl.java:serviceInit(299)) - Timeline service address: http://mdinglin02:0/ws/v1/timeline/
> 2015-11-02 21:57:38,099 INFO  [main] impl.TimelineClientImpl (TimelineClientImpl.java:logException(213)) - Exception caught by TimelineClientConnectionRetry, will try 30 more time(s).
> ...
> ...
> java.lang.RuntimeException: Failed to connect to timeline server. Connection retries limit exceeded. The posted timeline event may be missing
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl$TimelineClientConnectionRetry.retryOn(TimelineClientImpl.java:206)
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl$TimelineJerseyRetryFilter.handle(TimelineClientImpl.java:245)
>         at com.sun.jersey.api.client.Client.handle(Client.java:648)
>         at com.sun.jersey.api.client.WebResource.handle(WebResource.java:670)
>         at com.sun.jersey.api.client.WebResource.access$200(WebResource.java:74)
>         at com.sun.jersey.api.client.WebResource$Builder.post(WebResource.java:563)
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.doPostingObject(TimelineClientImpl.java:477)
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl$1.run(TimelineClientImpl.java:326)
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl$1.run(TimelineClientImpl.java:323)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:415)
>         at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1669)
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.doPosting(TimelineClientImpl.java:323)
>         at org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.putEntities(TimelineClientImpl.java:308)
>         at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.publishApplicationAttemptEvent(ApplicationMaster.java:1184)
>         at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.run(ApplicationMaster.java:571)
>         at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.main(ApplicationMaster.java:302)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)