You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "fox (JIRA)" <ji...@apache.org> on 2018/08/27 02:37:00 UTC

[jira] [Resolved] (YARN-8700) Application cannot un-registered

     [ https://issues.apache.org/jira/browse/YARN-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

fox resolved YARN-8700.
-----------------------
    Resolution: Fixed

> Application cannot un-registered
> --------------------------------
>
>                 Key: YARN-8700
>                 URL: https://issues.apache.org/jira/browse/YARN-8700
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.7.3
>            Reporter: fox
>            Priority: Major
>
> Dear all, 
> I found a problem with application unregistration in AWS EMR environment (emr-5.8.0, hadoop 2.7.3, spark 2.2.0). 
> Application Type: Both Yarn and Spark
> State: RUNNING
> Inside the job logs, I got 
> 07:00:07.190 [main] INFO c.w.c.e.a.n.b.AbstractNormalBatchMain - [EDP2] Ready to run Tear Down
> 07:00:07.192 [main] INFO c.w.c.e.a.n.b.AbstractNormalBatchMain - [EDP2] Ready to run Tear Down
> 07:00:07.192 [main] INFO c.w.c.e.a.n.b.AbstractNormalBatchMain - [EDP2] Job Finish
> 07:00:07.195 [main] INFO o.s.c.a.AnnotationConfigApplicationContext - Closing org.springframework.context.annotation.AnnotationConfigApplicationContext@144ab54: startup date [Tue Aug 21 06:59:23 UTC 2018]; root of context hierarchy
> 07:00:07.306 [main] INFO o.s.s.c.ThreadPoolTaskExecutor - Shutting down ExecutorService 'redisClusterExecutor'
> 07:00:07.551 [main] INFO o.a.k.clients.producer.KafkaProducer - Closing the Kafka producer with timeoutMillis = 9223372036854775807 ms.
> 07:00:07.565 [main] INFO c.w.c.f.m.MessageQueueKafkaProducerImpl - Closed all the producer's connections for tenant: 7fd0356c-1258-11e8-abfd-0242ac110002.
> 07:00:09.869 [main] INFO c.w.c.edp2.normal.batch.AppMaster - finish run main method
> 07:00:09.870 [main] INFO c.w.c.edp2.normal.batch.AppMaster - delete temp file /tmp/aa33f388-f591-44a8-9aa3-13e2f8427c5d2802069659156113885.jar
> 07:00:10.112 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
> 07:00:10.215 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
> 07:00:10.319 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
> 07:00:10.422 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
> 07:00:10.528 [main] INFO o.a.h.y.c.api.impl.AMRMClientImpl - Waiting for application to be successfully unregistered.
>  
> and it keeps more than one day until I stopped the whole cluster. 
> I also try to kill the application by yarn command, which also keeps forever waiting for application to be killed. 
> hadoop@ip-10-100-2-124 ~]$ yarn application -kill application_1534810852740_0721
> 18/08/22 12:24:28 INFO impl.TimelineClientImpl: Timeline service address: http://ip-10-100-2-124.ap-northeast-1.compute.internal:8188/ws/v1/timeline/
> 18/08/22 12:24:29 INFO client.RMProxy: Connecting to ResourceManager at ip-10-100-2-124.ap-northeast-1.compute.internal/10.100.2.124:8032
> Killing application application_1534810852740_0721
> 18/08/22 12:24:32 INFO impl.YarnClientImpl: Waiting for application application_1534810852740_0721 to be killed.
> 18/08/22 12:24:34 INFO impl.YarnClientImpl: Waiting for application application_1534810852740_0721 to be killed.
> 18/08/22 12:24:36 INFO impl.YarnClientImpl: Waiting for application application_1534810852740_0721 to be killed.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org