You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flink.apache.org by Anchit Jatana <de...@gmail.com> on 2016/11/10 01:01:46 UTC

Flink 1.1.3 | Shutting down YarnClusterClient from the client shutdown hook | happening frequently

Hi All,

I'm running my flink application on YARN. It's frequently getting suspended,
though gracefully. Below is the snippet of the error, attaching full
jobmanager log to help debug. Please help me identify the cause and resolve
the issue.

Thank you

Regards,
Anchit

Error snippet:

2016-11-09 03:15:01,238 INFO  org.apache.flink.yarn.YarnClusterClient                      
- Shutting down YarnClusterClient from the client shutdown hook
2016-11-09 03:15:01,242 INFO  org.apache.flink.yarn.YarnClusterClient                      
- Sending shutdown request to the Application Master
2016-11-09 03:15:01,248 INFO  org.apache.flink.yarn.YarnClusterClient                      
- Start application client.
2016-11-09 03:15:01,260 INFO  org.apache.flink.yarn.ApplicationClient                      
- Notification about new leader address
akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
2016-11-09 03:15:01,263 INFO  org.apache.flink.yarn.ApplicationClient                      
- Sending StopCluster request to JobManager.
2016-11-09 03:15:01,264 INFO  org.apache.flink.yarn.ApplicationClient                      
- Received address of new leader
akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
2016-11-09 03:15:01,265 INFO  org.apache.flink.yarn.ApplicationClient                      
- Disconnect from JobManager null.
2016-11-09 03:15:01,269 INFO  org.apache.flink.yarn.ApplicationClient                      
- Trying to register at JobManager
akka.tcp://flink@10.60.200.106:36465/user/jobmanager.
2016-11-09 03:15:01,279 INFO  org.apache.flink.yarn.ApplicationClient                      
- Successfully registered at the ResourceManager using JobManager
Actor[akka.tcp://flink@10.60.200.106:36465/user/jobmanager#918758350]
2016-11-09 03:15:02,282 INFO  org.apache.flink.yarn.ApplicationClient                      
- Sending StopCluster request to JobManager.
2016-11-09 03:15:02,295 INFO  org.apache.flink.yarn.YarnClusterClient                      
- Deleting files in
hdfs://ldnsns/user/a12345/.flink/application_1478099802210_11790
2016-11-09 03:15:02,319 INFO  org.apache.flink.runtime.client.JobClientActor               
- 11/09/2016 03:15:02	Job execution switched to status SUSPENDED.





--
View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Flink-1-1-3-Shutting-down-YarnClusterClient-from-the-client-shutdown-hook-happening-frequently-tp10019.html
Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.

Re: Flink 1.1.3 | Shutting down YarnClusterClient from the client shutdown hook | happening frequently

Posted by Till Rohrmann <tr...@apache.org>.
HI Anchit,

the ML does not support attachments. To understand your problem, the JM
logs would be really helpful.

Cheers,
Till

On Thu, Nov 10, 2016 at 2:01 AM, Anchit Jatana <development.anchit@gmail.com
> wrote:

> Hi All,
>
> I'm running my flink application on YARN. It's frequently getting
> suspended,
> though gracefully. Below is the snippet of the error, attaching full
> jobmanager log to help debug. Please help me identify the cause and resolve
> the issue.
>
> Thank you
>
> Regards,
> Anchit
>
> Error snippet:
>
> 2016-11-09 03:15:01,238 INFO  org.apache.flink.yarn.YarnClusterClient
> - Shutting down YarnClusterClient from the client shutdown hook
> 2016-11-09 03:15:01,242 INFO  org.apache.flink.yarn.YarnClusterClient
> - Sending shutdown request to the Application Master
> 2016-11-09 03:15:01,248 INFO  org.apache.flink.yarn.YarnClusterClient
> - Start application client.
> 2016-11-09 03:15:01,260 INFO  org.apache.flink.yarn.ApplicationClient
> - Notification about new leader address
> akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
> 2016-11-09 03:15:01,263 INFO  org.apache.flink.yarn.ApplicationClient
> - Sending StopCluster request to JobManager.
> 2016-11-09 03:15:01,264 INFO  org.apache.flink.yarn.ApplicationClient
> - Received address of new leader
> akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null.
> 2016-11-09 03:15:01,265 INFO  org.apache.flink.yarn.ApplicationClient
> - Disconnect from JobManager null.
> 2016-11-09 03:15:01,269 INFO  org.apache.flink.yarn.ApplicationClient
> - Trying to register at JobManager
> akka.tcp://flink@10.60.200.106:36465/user/jobmanager.
> 2016-11-09 03:15:01,279 INFO  org.apache.flink.yarn.ApplicationClient
> - Successfully registered at the ResourceManager using JobManager
> Actor[akka.tcp://flink@10.60.200.106:36465/user/jobmanager#918758350]
> 2016-11-09 03:15:02,282 INFO  org.apache.flink.yarn.ApplicationClient
> - Sending StopCluster request to JobManager.
> 2016-11-09 03:15:02,295 INFO  org.apache.flink.yarn.YarnClusterClient
> - Deleting files in
> hdfs://ldnsns/user/a12345/.flink/application_1478099802210_11790
> 2016-11-09 03:15:02,319 INFO  org.apache.flink.runtime.
> client.JobClientActor
> - 11/09/2016 03:15:02   Job execution switched to status SUSPENDED.
>
>
>
>
>
> --
> View this message in context: http://apache-flink-user-
> mailing-list-archive.2336050.n4.nabble.com/Flink-1-1-3-Shutting-down-
> YarnClusterClient-from-the-client-shutdown-hook-
> happening-frequently-tp10019.html
> Sent from the Apache Flink User Mailing List archive. mailing list archive
> at Nabble.com.
>