You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Jian He (JIRA)" <ji...@apache.org> on 2014/06/25 06:59:26 UTC

[jira] [Commented] (YARN-1366) AM should implement Resync with the ApplicationMasterService instead of shutting down

    [ https://issues.apache.org/jira/browse/YARN-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14043044#comment-14043044 ] 

Jian He commented on YARN-1366:
-------------------------------

Hi [~rohithsharma],  start looking at the patch. it''s been a while,  Mind updating the patch please ? thanks!

some comment in the meanwhile: 
Skim through testAMRMClientResendsRequestsOnRMRestart, can you add comment above the test method to explain the steps involved?
Ideally, the test should be 1.AMRMClient allocate some containers, 2. RM restarted before containers are allocated to AM. 3. On RM restart, previous container requests are automatically re-sent by AMRMClient on re-register. 4. assert the containers are allocated by the new RM. similarly for releaseList, blacklist.

> AM should implement Resync with the ApplicationMasterService instead of shutting down
> -------------------------------------------------------------------------------------
>
>                 Key: YARN-1366
>                 URL: https://issues.apache.org/jira/browse/YARN-1366
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Bikas Saha
>            Assignee: Rohith
>         Attachments: YARN-1366.1.patch, YARN-1366.2.patch, YARN-1366.3.patch, YARN-1366.4.patch, YARN-1366.patch, YARN-1366.prototype.patch, YARN-1366.prototype.patch
>
>
> The ApplicationMasterService currently sends a resync response to which the AM responds by shutting down. The AM behavior is expected to change to calling resyncing with the RM. Resync means resetting the allocate RPC sequence number to 0 and the AM should send its entire outstanding request to the RM. Note that if the AM is making its first allocate call to the RM then things should proceed like normal without needing a resync. The RM will return all containers that have completed since the RM last synced with the AM. Some container completions may be reported more than once.



--
This message was sent by Atlassian JIRA
(v6.2#6252)