You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Song Jiacheng (Jira)" <ji...@apache.org> on 2021/06/23 07:30:00 UTC

[jira] [Commented] (YARN-10794) Submitting jobs to a single subcluster will fail while AMRMProxy is enabled

    [ https://issues.apache.org/jira/browse/YARN-10794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17367921#comment-17367921 ] 

Song Jiacheng commented on YARN-10794:
--------------------------------------

https://issues.apache.org/jira/browse/YARN-9693

> Submitting jobs to a single subcluster will fail while AMRMProxy is enabled
> ---------------------------------------------------------------------------
>
>                 Key: YARN-10794
>                 URL: https://issues.apache.org/jira/browse/YARN-10794
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 3.2.1
>            Reporter: Song Jiacheng
>            Priority: Major
>         Attachments: YARN-10794.v1.patch, YARN-10794.v2.patch
>
>
> Sorry for not knowing how to quote a issue...
> https://issues.apache.org/jira/browse/YARN-9693
> This issue has already raised this problem, but it seems that I can't submit job by the federation client while using the patch.
> The original reason of this problem is that NM will set a local AMRMToken for AM if AMRMProxy is enabled, so that AM will fail if it contact with RM directly.
> This problem makes it impossible to rolling upgrade to federation, cause we can't upgrade all the NMs and clients at one moment
> So I developed another patch, using this patch I can submit jobs via the both ways.
> My solution is that hold two tokens at the same time, and choose a right one during the building of RPC Client.
> I tested this patch in some situations like AM recover, NM recover, no error found.
> But still, I can't ensure this patch is good, so i wonder if there is a better solution.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org