You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Bibin A Chundatt (JIRA)" <ji...@apache.org> on 2018/07/23 06:38:00 UTC

[jira] [Comment Edited] (YARN-8558) NM recovery level db not cleaned up properly on container finish

    [ https://issues.apache.org/jira/browse/YARN-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16552393#comment-16552393 ] 

Bibin A Chundatt edited comment on YARN-8558 at 7/23/18 6:37 AM:
-----------------------------------------------------------------

Thank you [~sunilg]

Per secret manager thr can be only one current and previous master keys. 
Updates the same path at expiry intervals.

Not based on container ID. IIUC no need to handle the same


was (Author: bibinchundatt):
Thank you [~sunilg]

Per secret manager thr can be only one current and previous master keys. 
Not based on container. IIUC no need to handle the same

> NM recovery level db not cleaned up properly on container finish
> ----------------------------------------------------------------
>
>                 Key: YARN-8558
>                 URL: https://issues.apache.org/jira/browse/YARN-8558
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 3.0.0, 3.1.0
>            Reporter: Bibin A Chundatt
>            Assignee: Bibin A Chundatt
>            Priority: Critical
>         Attachments: YARN-8558.001.patch
>
>
> {code}
> 2018-07-20 16:49:23,117 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl: Application application_1531994217928_0054 transitioned from NEW to INITING
> 2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000018 with incomplete records
> 2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000019 with incomplete records
> 2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000020 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000021 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000022 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000023 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000024 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000025 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000038 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000039 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000041 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000044 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000046 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000049 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000052 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000054 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000073 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000074 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000075 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000078 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000079 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000082 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000083 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000085 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627738 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627742 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627746 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627749 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627753 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627757 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627761 with incomplete records
> 2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627765 with incomplete records
> 2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627769 with incomplete records
> 2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627773 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627679 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627681 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627684 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627690 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627695 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627696 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627702 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627706 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627710 with incomplete records
> 2018-07-20 16:49:23,211 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627712 with incomplete records
> {code}
> NM state store size could increase in long running scenarios, and recovery could be slow



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org