You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "fox (JIRA)" <ji...@apache.org> on 2018/08/23 03:00:00 UTC

[jira] [Created] (YARN-8700) Application cannot un-registered

fox created YARN-8700:
-------------------------

             Summary: Application cannot un-registered
                 Key: YARN-8700
                 URL: https://issues.apache.org/jira/browse/YARN-8700
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager
    Affects Versions: 2.7.3
            Reporter: fox


Dear all, 

I found a problem with application unregistration in AWS EMR environment (emr-5.8.0, hadoop 2.7.3, spark 2.2.0). 

Application Type: Both Yarn and Spark

State: RUNNING

Inside the job logs, I got 

07:00:07.190 [main] INFO c.w.c.e.a.n.b.AbstractNormalBatchMain - [EDP2] Ready to run Tear Down
07:00:07.192 [main] INFO c.w.c.e.a.n.b.AbstractNormalBatchMain - [EDP2] Ready to run Tear Down
07:00:07.192 [main] INFO c.w.c.e.a.n.b.AbstractNormalBatchMain - [EDP2] Job Finish
07:00:07.195 [main] INFO o.s.c.a.AnnotationConfigApplicationContext - Closing org.springframework.context.annotation.AnnotationConfigApplicationContext@144ab54: startup date [Tue Aug 21 06:59:23 UTC 2018]; root of context hierarchy
07:00:07.306 [main] INFO o.s.s.c.ThreadPoolTaskExecutor - Shutting down ExecutorService 'redisClusterExecutor'
07:00:07.551 [main] INFO o.a.k.clients.producer.KafkaProducer - Closing the Kafka producer with timeoutMillis = 9223372036854775807 ms.
07:00:07.565 [main] INFO c.w.c.f.m.MessageQueueKafkaProducerImpl - Closed all the producer's connections for tenant: 7fd0356c-1258-11e8-abfd-0242ac110002.
07:00:09.869 [main] INFO c.w.c.edp2.normal.batch.AppMaster - finish run main method
07:00:09.870 [main] INFO c.w.c.edp2.normal.batch.AppMaster - delete temp file /tmp/aa33f388-f591-44a8-9aa3-13e2f8427c5d2802069659156113885.jar
07:00:10.112 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
07:00:10.215 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
07:00:10.319 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
07:00:10.422 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
07:00:10.528 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.

 

and it keeps more than one day until I stopped the whole cluster. 

I also try to kill the application by yarn command, which also keeps forever waiting for application to be killed. 

hadoop@ip-10-100-2-124 ~]$ yarn application -kill application_1534810852740_0721
18/08/22 12:24:28 INFO impl.TimelineClientImpl: Timeline service address: http://ip-10-100-2-124.ap-northeast-1.compute.internal:8188/ws/v1/timeline/
18/08/22 12:24:29 INFO client.RMProxy: Connecting to ResourceManager at ip-10-100-2-124.ap-northeast-1.compute.internal/10.100.2.124:8032
Killing application application_1534810852740_0721
18/08/22 12:24:32 INFO impl.YarnClientImpl: Waiting for application application_1534810852740_0721 to be killed.
18/08/22 12:24:34 INFO impl.YarnClientImpl: Waiting for application application_1534810852740_0721 to be killed.
18/08/22 12:24:36 INFO impl.YarnClientImpl: Waiting for application application_1534810852740_0721 to be killed.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org