You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Yesha Vora (JIRA)" <ji...@apache.org> on 2018/10/17 22:36:00 UTC

[jira] [Created] (YARN-8901) Restart "NEVER" policy does not work with component dependency

Yesha Vora created YARN-8901:
--------------------------------

             Summary: Restart "NEVER" policy does not work with component dependency
                 Key: YARN-8901
                 URL: https://issues.apache.org/jira/browse/YARN-8901
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Yesha Vora


Scenario:

1) Launch an application with two components. master and worker. Here, worker is dependent on master. ( Worker should be launched only after master is launched )
2) Set restart_policy = NEVER for both master and worker. 

{code:title=sample launch.json}
{
	"name": "mawo-hadoop-ut",
        "artifact": {
                "type": "DOCKER",
                "id": "xxx"
        },
        "configuration": {
                "env": {
                       "YARN_CONTAINER_RUNTIME_DOCKER_CONTAINER_NETWORK": "hadoop"
                 },
                "properties": {
                       "docker.network": "hadoop"
                }
        },
	"components": [{
		"dependencies": [],
		"resource": {
			"memory": "2048",
			"cpus": "1"
		},
		"name": "master",
                "run_privileged_container": true,
		"number_of_containers": 1,
		"launch_command": "start master",
                "restart_policy": "NEVER",
	}, {
		"dependencies": ["master"],
		"resource": {
			"memory": "8072",
			"cpus": "1"
		},
		"name": "worker",
                "run_privileged_container": true,
		"number_of_containers": 10,
		"launch_command": "start worker",
                "restart_policy": "NEVER",
	}],
	"lifetime": -1,
	"version": 1.0
}{code}

When restart policy is selected to NEVER, AM never launches Worker component. It get stuck with below message. 
{code}
2018-10-17 15:11:58,560 [Component  dispatcher] INFO  component.Component - [COMPONENT master] Transitioned from FLEXING to STABLE on CHECK_STABLE event.
2018-10-17 15:11:58,560 [pool-7-thread-1] INFO  instance.ComponentInstance - [COMPINSTANCE master-0 : container_e41_1539027682947_0020_01_000002] Transitioned from STARTED to READY on BECOME_READY event
2018-10-17 15:11:58,560 [pool-7-thread-1] INFO  component.Component - [COMPONENT worker]: Dependency master not satisfied, only 1 of 1 instances are ready or the dependent component has not completed 
2018-10-17 15:12:28,556 [pool-7-thread-1] INFO  component.Component - [COMPONENT worker]: Dependency master not satisfied, only 1 of 1 instances are ready or the dependent component has not completed 
2018-10-17 15:12:58,556 [pool-7-thread-1] INFO  component.Component - [COMPONENT worker]: Dependency master not satisfied, only 1 of 1 instances are ready or the dependent component has not completed 
2018-10-17 15:13:28,556 [pool-7-thread-1] INFO  component.Component - [COMPONENT worker]: Dependency master not satisfied, only 1 of 1 instances are ready or the dependent component has not completed 
2018-10-17 15:13:58,556 [pool-7-thread-1] INFO  component.Component - [COMPONENT worker]: Dependency master not satisfied, only 1 of 1 instances are ready or the dependent component has not completed 
2018-10-17 15:14:28,556 [pool-7-thread-1] INFO  component.Component - [COMPONENT worker]: Dependency master not satisfied, only 1 of 1 instances are ready or the dependent component has not completed {code}

'NEVER' restart policy expects master component to be finished before starting workers. Master component can not finish the job without workers. Thus, it create a deadlock.

The logic for 'NEVER' restart policy should be fixed to allow worker components to be launched as soon as master component is in READY state. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org