You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Wang, Xinglong (Jira)" <ji...@apache.org> on 2019/09/25 04:06:00 UTC

[jira] [Created] (YARN-9854) RM jetty hang due to WebAppProxyServlet lacks of timeout while doing proxyLink

Wang, Xinglong created YARN-9854:
------------------------------------

             Summary: RM jetty hang due to WebAppProxyServlet lacks of timeout while doing proxyLink
                 Key: YARN-9854
                 URL: https://issues.apache.org/jira/browse/YARN-9854
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: amrmproxy, resourcemanager, webapp
            Reporter: Wang, Xinglong
            Assignee: Wang, Xinglong


RM will proxy url request to [http://rm:port/proxy/application_xxxxx] to AM or related history server.

Recently we met an issue https://issues.apache.org/jira/browse/SPARK-26961 which will cause Spark AM hang forever.

And we have a monitor tool to access [http://rm:port/proxy/application_xxxxx]  periodically. Thus all proxied connection to the hang spark AM will also hang forever due to WebAppProxyServlet is lacking of socket connection timeout setting while initialize httpclient towards this spark AM.

 

The jetty server holding RM servlets is with limited threads. In this case, each time one such thread will hang due to waiting for Spark AM response. Eventually all jetty threads serving http traffic hang and caused all RM web links not responsive. 

 

If we give timeout config to httpclient, we will be free of this issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org