You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Prabhu Joseph (Jira)" <ji...@apache.org> on 2023/01/11 18:18:00 UTC

[jira] [Created] (YARN-11417) RM Crashes when changing Node Label of a Node in Distributed Configuration

Prabhu Joseph created YARN-11417:
------------------------------------

             Summary: RM Crashes when changing Node Label of a Node in Distributed Configuration
                 Key: YARN-11417
                 URL: https://issues.apache.org/jira/browse/YARN-11417
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager
    Affects Versions: 3.3.3
            Reporter: Prabhu Joseph
            Assignee: Prabhu Joseph


RM Crashes when changing Node Label of a Node in Distributed Configuration.

{code}
2023-01-11 16:25:50,986 ERROR org.apache.hadoop.yarn.event.EventDispatcher (SchedulerEventDispatcher:Event Processor): Error in handling event type NODE_REMOVED to the Event Dispatcher
java.lang.NullPointerException
        at org.apache.hadoop.yarn.server.resourcemanager.scheduler.ClusterNodeTracker.removeNode(ClusterNodeTracker.java:194)
        at org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.removeNode(CapacityScheduler.java:2145)
        at org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:1833)
        at org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:171)
        at org.apache.hadoop.yarn.event.EventDispatcher$EventProcessor.run(EventDispatcher.java:83)
        at java.lang.Thread.run(Thread.java:750)

{code}


*Repro*

1. Two NodeManagers with CORE Node Label

{code}
yarn.nodemanager.node-labels.provider.configured-node-partition=CORE
yarn.node-labels.enabled = true
yarn.node-labels.configuration-type = distributed
yarn.nodemanager.node-labels.provider = config
{code}

2. Change the Node Label of one of the node into TASK and restart nodemanager.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org