You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Wang, Xinglong (Jira)" <ji...@apache.org> on 2019/09/25 04:06:00 UTC
[jira] [Created] (YARN-9854) RM jetty hang due to
WebAppProxyServlet lacks of timeout while doing proxyLink
Wang, Xinglong created YARN-9854:
------------------------------------
Summary: RM jetty hang due to WebAppProxyServlet lacks of timeout while doing proxyLink
Key: YARN-9854
URL: https://issues.apache.org/jira/browse/YARN-9854
Project: Hadoop YARN
Issue Type: Improvement
Components: amrmproxy, resourcemanager, webapp
Reporter: Wang, Xinglong
Assignee: Wang, Xinglong
RM will proxy url request to [http://rm:port/proxy/application_xxxxx] to AM or related history server.
Recently we met an issue https://issues.apache.org/jira/browse/SPARK-26961 which will cause Spark AM hang forever.
And we have a monitor tool to access [http://rm:port/proxy/application_xxxxx] periodically. Thus all proxied connection to the hang spark AM will also hang forever due to WebAppProxyServlet is lacking of socket connection timeout setting while initialize httpclient towards this spark AM.
The jetty server holding RM servlets is with limited threads. In this case, each time one such thread will hang due to waiting for Spark AM response. Eventually all jetty threads serving http traffic hang and caused all RM web links not responsive.
If we give timeout config to httpclient, we will be free of this issue.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org