You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Qi Zhu (Jira)" <ji...@apache.org> on 2021/03/30 05:42:00 UTC

[jira] [Comment Edited] (YARN-10720) YARN WebAppProxyServlet should support connection timeout to prevent too many abnormal connections.

    [ https://issues.apache.org/jira/browse/YARN-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17310481#comment-17310481 ] 

Qi Zhu edited comment on YARN-10720 at 3/30/21, 5:41 AM:
---------------------------------------------------------

cc  [~pbacsko] [~ebadger] [~Jim_Brennan]  [~ztang]  [~epayne] [~gandras]  [~bteke] [~brahmareddy]

 

Could you help review this?

Tested in our test cluster, works well.

Thanks.


was (Author: zhuqi):
cc  [~pbacsko] [~ebadger] [~Jim_Brennan]  [~ztang]  [~epayne] [~gandras]  [~bteke]

Could you help review this?

Tested in our test cluster, works well.

Thanks.

> YARN WebAppProxyServlet should support connection timeout to prevent too many abnormal connections.
> ---------------------------------------------------------------------------------------------------
>
>                 Key: YARN-10720
>                 URL: https://issues.apache.org/jira/browse/YARN-10720
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Qi Zhu
>            Assignee: Qi Zhu
>            Priority: Critical
>         Attachments: YARN-10720.001.patch, YARN-10720.002.patch, YARN-10720.003.patch, image-2021-03-29-14-04-33-776.png, image-2021-03-29-14-05-32-708.png
>
>
> Following is proxy server show, {color:#de350b}too many connections from one client{color}, this caused the proxy server hang, and the yarn web can't jump to web proxy.
> !image-2021-03-29-14-04-33-776.png|width=632,height=57!
> Following is the AM which is abnormal, but proxy server don't know it is abnormal already, so the connections can't be closed, we should add time out support in proxy server to prevent this. And one abnormal AM may cause hundreds even thousands of connections, it is very heavy.
> !image-2021-03-29-14-05-32-708.png|width=669,height=101!
>  
> After i kill the abnormal AM, the proxy server become healthy. This case happened many times in our production clusters, our clusters are huge, and the abnormal AM will be existed in a regular case.
>  
> I will add timeout supported in web proxy server in this jira.
>  
> cc  [~pbacsko] [~ebadger] [~Jim_Brennan]  [~ztang]  [~epayne] [~gandras]  [~bteke]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org