You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Bibin A Chundatt (JIRA)" <ji...@apache.org> on 2018/07/20 12:01:00 UTC

[jira] [Commented] (YARN-8558) NM recovery level db not cleaned up properly on container finish

    [ https://issues.apache.org/jira/browse/YARN-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16550703#comment-16550703 ] 

Bibin A Chundatt commented on YARN-8558:
----------------------------------------

On container removal the following are missing . {{CONTAINER_START_TIME_KEY_SUFFIX}} is creating  the major  problem. Gets added on every container start call.
{code}
CONTAINER_START_TIME_KEY_SUFFIX
CONTAINER_VERSION_KEY_SUFFIX
CONTAINER_REMAIN_RETRIES_KEY_SUFFIX
CONTAINER_RESTART_TIMES_SUFFIX
CONTAINER_WORK_DIR_KEY_SUFFIX
CONTAINER_LOG_DIR_KEY_SUFFIX
{code}

> NM recovery level db not cleaned up properly on container finish
> ----------------------------------------------------------------
>
>                 Key: YARN-8558
>                 URL: https://issues.apache.org/jira/browse/YARN-8558
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 3.0.0, 3.1.0
>            Reporter: Bibin A Chundatt
>            Assignee: Bibin A Chundatt
>            Priority: Critical
>
> {code}
> 2018-07-20 16:49:23,117 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl: Application application_1531994217928_0054 transitioned from NEW to INITING
> 2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000018 with incomplete records
> 2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000019 with incomplete records
> 2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000020 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000021 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000022 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000023 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000024 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000025 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000038 with incomplete records
> 2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000039 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000041 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000044 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000046 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000049 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000052 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000054 with incomplete records
> 2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000073 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000074 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000075 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000078 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000079 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000082 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000083 with incomplete records
> 2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000085 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627738 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627742 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627746 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627749 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627753 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627757 with incomplete records
> 2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627761 with incomplete records
> 2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627765 with incomplete records
> 2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627769 with incomplete records
> 2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627773 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627679 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627681 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627684 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627690 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627695 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627696 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627702 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627706 with incomplete records
> 2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627710 with incomplete records
> 2018-07-20 16:49:23,211 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627712 with incomplete records
> {code}
> NM state store size could increase in long running scenarios, and recovery could be slow



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org