You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flink.apache.org by Márton Balassi <ba...@gmail.com> on 2022/05/02 08:00:11 UTC
Re: flink operator sometimes cannot start jobmanager after upgrading
Hi ChangZhuo,
Thanks for reporting this, I think I have just run into this myself too.
Will try to reproduce it, but I do not fully comprehend it yet. If anyone
has a way to reproduce it is more than welcome. :-)
On Fri, Apr 29, 2022 at 12:16 PM ChangZhuo Chen (陳昌倬) <cz...@czchen.org>
wrote:
> Hi,
>
> We found that flink operator [0] sometimes cannot start jobmanager after
> upgrading FlinkDeployment. We need to recreate FlinkDeployment to fix
> the problem. Anyone has this issue?
>
> The following is redacted log from flink operator. After status becomes
> MISSING, it keeps in MISSING status for at least 15 minutes.
>
>
> 2022-04-29 09:41:15,141 o.a.f.c.d.a.c.ApplicationClusterDeployer [INFO
> ][namespace/flink-deployment-name] Submitting application in 'Application
> Mode'.
> 2022-04-29 09:41:15,145 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
> memory (2.400gb (2576980416 bytes)) is greater than its max value
> 1024.000mb (1073741824 bytes), max value will be used instead
> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
> memory (5.200gb (5583457568 bytes)) is greater than its max value
> 1024.000mb (1073741824 bytes), max value will be used instead
> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
> ][namespace/flink-deployment-name] The derived from fraction network memory
> (5.050gb (5422396292 bytes)) is greater than its max value 4.000gb
> (4294967296 bytes), max value will be used instead
> 2022-04-29 09:41:15,237 o.a.f.k.u.KubernetesUtils [INFO
> ][namespace/flink-deployment-name] Kubernetes deployment requires a fixed
> port. Configuration high-availability.jobmanager.port will be set to 6123
> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [WARN
> ][namespace/flink-deployment-name] Please note that Flink client
> operations(e.g. cancel, list, stop, savepoint, etc.) won't work from
> outside the Kubernetes cluster since 'kubernetes.rest-service.exposed.type'
> has been set to ClusterIP.
> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [INFO
> ][namespace/flink-deployment-name] Create flink application cluster
> flink-deployment-name successfully, JobManager Web Interface:
> http://flink-deployment-name.namespace:8081
> 2022-04-29 09:41:15,510 o.a.f.k.o.s.FlinkService [INFO
> ][namespace/flink-deployment-name] Application cluster successfully deployed
> 2022-04-29 09:41:15,583 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:15,684 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:15,686 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:15,792 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager is being deployed
> 2022-04-29 09:41:15,792 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:20,795 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:20,797 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:20,896 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager is being deployed
> 2022-04-29 09:41:20,897 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:25,899 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:25,901 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:25,997 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager is being deployed
> 2022-04-29 09:41:25,998 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:29,518 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:29,520 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:30,631 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager is being deployed
> 2022-04-29 09:41:30,631 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:35,639 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:35,640 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:35,756 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager is being deployed
> 2022-04-29 09:41:35,756 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:40,759 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:40,760 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:40,864 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager is being deployed
> 2022-04-29 09:41:40,864 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:45,867 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:45,868 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYING
> 2022-04-29 09:41:45,870 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager deployment port is ready,
> waiting for the Flink REST API...
> 2022-04-29 09:41:45,870 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:55,901 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: DEPLOYED_NOT_READY
> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager deployment is ready
> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing job status
> 2022-04-29 09:41:56,294 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] No job found on cluster yet
> 2022-04-29 09:41:56,294 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:41:58,443 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:41:58,445 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing job status
> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver
> [ERROR][namespace/flink-deployment-name] Exception while listing jobs
> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: READY
> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager deployment does not exist
> 2022-04-29 09:42:10,490 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:42:25,521 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: MISSING
> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager deployment does not exist
> 2022-04-29 09:42:25,522 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> 2022-04-29 09:42:40,526 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: MISSING
> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager deployment does not exist
> 2022-04-29 09:42:40,527 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
> ...
>
> 2022-04-29 10:00:55,862 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Starting reconciliation
> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] Observing JobManager deployment.
> Previous status: MISSING
> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
> ][namespace/flink-deployment-name] JobManager deployment does not exist
> 2022-04-29 10:00:55,863 o.a.f.k.o.c.FlinkDeploymentController [INFO
> ][namespace/flink-deployment-name] Reconciliation successfully completed
>
>
> [0] https://github.com/apache/flink-kubernetes-operator
>
>
> --
> ChangZhuo Chen (陳昌倬) czchen@{czchen,debian}.org
> http://czchen.info/
> Key fingerprint = BA04 346D C2E1 FE63 C790 8793 CC65 B0CD EC27 5D5B
>
Re: flink operator sometimes cannot start jobmanager after upgrading
Posted by Yang Wang <da...@gmail.com>.
I am afraid we do not handle the scenario that the JobManager deployment is
deleted externally.
Best,
Yang
Őrhidi Mátyás <ma...@gmail.com> 于2022年5月2日周一 16:52写道:
> I filed a Jira for tracking this issue:
> https://issues.apache.org/jira/browse/FLINK-27468
>
> On Mon, May 2, 2022 at 10:31 AM Őrhidi Mátyás <ma...@gmail.com>
> wrote:
>
>> This can be reproduced simply by deleting the kubernetes deployment. The
>> operator cannot recover from this state automatically, by defining a
>> restartNonce on the deployment should recover the state.
>>
>> Regards,
>> Matyas
>>
>> On Mon, May 2, 2022 at 10:00 AM Márton Balassi <ba...@gmail.com>
>> wrote:
>>
>>> Hi ChangZhuo,
>>>
>>> Thanks for reporting this, I think I have just run into this myself too.
>>> Will try to reproduce it, but I do not fully comprehend it yet. If anyone
>>> has a way to reproduce it is more than welcome. :-)
>>>
>>> On Fri, Apr 29, 2022 at 12:16 PM ChangZhuo Chen (陳昌倬) <cz...@czchen.org>
>>> wrote:
>>>
>>>> Hi,
>>>>
>>>> We found that flink operator [0] sometimes cannot start jobmanager after
>>>> upgrading FlinkDeployment. We need to recreate FlinkDeployment to fix
>>>> the problem. Anyone has this issue?
>>>>
>>>> The following is redacted log from flink operator. After status becomes
>>>> MISSING, it keeps in MISSING status for at least 15 minutes.
>>>>
>>>>
>>>> 2022-04-29 09:41:15,141 o.a.f.c.d.a.c.ApplicationClusterDeployer
>>>> [INFO ][namespace/flink-deployment-name] Submitting application in
>>>> 'Application Mode'.
>>>> 2022-04-29 09:41:15,145 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>>>> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
>>>> memory (2.400gb (2576980416 bytes)) is greater than its max value
>>>> 1024.000mb (1073741824 bytes), max value will be used instead
>>>> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>>>> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
>>>> memory (5.200gb (5583457568 bytes)) is greater than its max value
>>>> 1024.000mb (1073741824 bytes), max value will be used instead
>>>> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>>>> ][namespace/flink-deployment-name] The derived from fraction network memory
>>>> (5.050gb (5422396292 bytes)) is greater than its max value 4.000gb
>>>> (4294967296 bytes), max value will be used instead
>>>> 2022-04-29 09:41:15,237 o.a.f.k.u.KubernetesUtils [INFO
>>>> ][namespace/flink-deployment-name] Kubernetes deployment requires a fixed
>>>> port. Configuration high-availability.jobmanager.port will be set to 6123
>>>> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [WARN
>>>> ][namespace/flink-deployment-name] Please note that Flink client
>>>> operations(e.g. cancel, list, stop, savepoint, etc.) won't work from
>>>> outside the Kubernetes cluster since 'kubernetes.rest-service.exposed.type'
>>>> has been set to ClusterIP.
>>>> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [INFO
>>>> ][namespace/flink-deployment-name] Create flink application cluster
>>>> flink-deployment-name successfully, JobManager Web Interface:
>>>> http://flink-deployment-name.namespace:8081
>>>> 2022-04-29 09:41:15,510 o.a.f.k.o.s.FlinkService [INFO
>>>> ][namespace/flink-deployment-name] Application cluster successfully deployed
>>>> 2022-04-29 09:41:15,583 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:15,684 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:15,686 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:15,792 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>>> 2022-04-29 09:41:15,792 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:20,795 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:20,797 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:20,896 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>>> 2022-04-29 09:41:20,897 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:25,899 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:25,901 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:25,997 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>>> 2022-04-29 09:41:25,998 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:29,518 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:29,520 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:30,631 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>>> 2022-04-29 09:41:30,631 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:35,639 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:35,640 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:35,756 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>>> 2022-04-29 09:41:35,756 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:40,759 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:40,760 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:40,864 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>>> 2022-04-29 09:41:40,864 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:45,867 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:45,868 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYING
>>>> 2022-04-29 09:41:45,870 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager deployment port is ready,
>>>> waiting for the Flink REST API...
>>>> 2022-04-29 09:41:45,870 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:55,901 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: DEPLOYED_NOT_READY
>>>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager deployment is ready
>>>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing job status
>>>> 2022-04-29 09:41:56,294 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] No job found on cluster yet
>>>> 2022-04-29 09:41:56,294 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:41:58,443 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:41:58,445 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing job status
>>>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver
>>>> [ERROR][namespace/flink-deployment-name] Exception while listing jobs
>>>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: READY
>>>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>>> 2022-04-29 09:42:10,490 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:42:25,521 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: MISSING
>>>> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>>> 2022-04-29 09:42:25,522 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> 2022-04-29 09:42:40,526 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: MISSING
>>>> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>>> 2022-04-29 09:42:40,527 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>> ...
>>>>
>>>> 2022-04-29 10:00:55,862 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Starting reconciliation
>>>> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>>> Previous status: MISSING
>>>> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
>>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>>> 2022-04-29 10:00:55,863 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>>
>>>>
>>>> [0] https://github.com/apache/flink-kubernetes-operator
>>>>
>>>>
>>>> --
>>>> ChangZhuo Chen (陳昌倬) czchen@{czchen,debian}.org
>>>> http://czchen.info/
>>>> Key fingerprint = BA04 346D C2E1 FE63 C790 8793 CC65 B0CD EC27 5D5B
>>>>
>>>
Re: flink operator sometimes cannot start jobmanager after upgrading
Posted by Őrhidi Mátyás <ma...@gmail.com>.
I filed a Jira for tracking this issue:
https://issues.apache.org/jira/browse/FLINK-27468
On Mon, May 2, 2022 at 10:31 AM Őrhidi Mátyás <ma...@gmail.com>
wrote:
> This can be reproduced simply by deleting the kubernetes deployment. The
> operator cannot recover from this state automatically, by defining a
> restartNonce on the deployment should recover the state.
>
> Regards,
> Matyas
>
> On Mon, May 2, 2022 at 10:00 AM Márton Balassi <ba...@gmail.com>
> wrote:
>
>> Hi ChangZhuo,
>>
>> Thanks for reporting this, I think I have just run into this myself too.
>> Will try to reproduce it, but I do not fully comprehend it yet. If anyone
>> has a way to reproduce it is more than welcome. :-)
>>
>> On Fri, Apr 29, 2022 at 12:16 PM ChangZhuo Chen (陳昌倬) <cz...@czchen.org>
>> wrote:
>>
>>> Hi,
>>>
>>> We found that flink operator [0] sometimes cannot start jobmanager after
>>> upgrading FlinkDeployment. We need to recreate FlinkDeployment to fix
>>> the problem. Anyone has this issue?
>>>
>>> The following is redacted log from flink operator. After status becomes
>>> MISSING, it keeps in MISSING status for at least 15 minutes.
>>>
>>>
>>> 2022-04-29 09:41:15,141 o.a.f.c.d.a.c.ApplicationClusterDeployer
>>> [INFO ][namespace/flink-deployment-name] Submitting application in
>>> 'Application Mode'.
>>> 2022-04-29 09:41:15,145 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>>> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
>>> memory (2.400gb (2576980416 bytes)) is greater than its max value
>>> 1024.000mb (1073741824 bytes), max value will be used instead
>>> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>>> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
>>> memory (5.200gb (5583457568 bytes)) is greater than its max value
>>> 1024.000mb (1073741824 bytes), max value will be used instead
>>> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>>> ][namespace/flink-deployment-name] The derived from fraction network memory
>>> (5.050gb (5422396292 bytes)) is greater than its max value 4.000gb
>>> (4294967296 bytes), max value will be used instead
>>> 2022-04-29 09:41:15,237 o.a.f.k.u.KubernetesUtils [INFO
>>> ][namespace/flink-deployment-name] Kubernetes deployment requires a fixed
>>> port. Configuration high-availability.jobmanager.port will be set to 6123
>>> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [WARN
>>> ][namespace/flink-deployment-name] Please note that Flink client
>>> operations(e.g. cancel, list, stop, savepoint, etc.) won't work from
>>> outside the Kubernetes cluster since 'kubernetes.rest-service.exposed.type'
>>> has been set to ClusterIP.
>>> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [INFO
>>> ][namespace/flink-deployment-name] Create flink application cluster
>>> flink-deployment-name successfully, JobManager Web Interface:
>>> http://flink-deployment-name.namespace:8081
>>> 2022-04-29 09:41:15,510 o.a.f.k.o.s.FlinkService [INFO
>>> ][namespace/flink-deployment-name] Application cluster successfully deployed
>>> 2022-04-29 09:41:15,583 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:15,684 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:15,686 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:15,792 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>> 2022-04-29 09:41:15,792 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:20,795 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:20,797 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:20,896 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>> 2022-04-29 09:41:20,897 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:25,899 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:25,901 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:25,997 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>> 2022-04-29 09:41:25,998 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:29,518 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:29,520 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:30,631 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>> 2022-04-29 09:41:30,631 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:35,639 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:35,640 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:35,756 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>> 2022-04-29 09:41:35,756 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:40,759 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:40,760 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:40,864 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager is being deployed
>>> 2022-04-29 09:41:40,864 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:45,867 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:45,868 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYING
>>> 2022-04-29 09:41:45,870 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager deployment port is ready,
>>> waiting for the Flink REST API...
>>> 2022-04-29 09:41:45,870 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:55,901 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: DEPLOYED_NOT_READY
>>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager deployment is ready
>>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing job status
>>> 2022-04-29 09:41:56,294 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] No job found on cluster yet
>>> 2022-04-29 09:41:56,294 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:41:58,443 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:41:58,445 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing job status
>>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver
>>> [ERROR][namespace/flink-deployment-name] Exception while listing jobs
>>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: READY
>>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>> 2022-04-29 09:42:10,490 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:42:25,521 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: MISSING
>>> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>> 2022-04-29 09:42:25,522 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> 2022-04-29 09:42:40,526 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: MISSING
>>> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>> 2022-04-29 09:42:40,527 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>> ...
>>>
>>> 2022-04-29 10:00:55,862 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Starting reconciliation
>>> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>>> Previous status: MISSING
>>> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
>>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>>> 2022-04-29 10:00:55,863 o.a.f.k.o.c.FlinkDeploymentController [INFO
>>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>>
>>>
>>> [0] https://github.com/apache/flink-kubernetes-operator
>>>
>>>
>>> --
>>> ChangZhuo Chen (陳昌倬) czchen@{czchen,debian}.org
>>> http://czchen.info/
>>> Key fingerprint = BA04 346D C2E1 FE63 C790 8793 CC65 B0CD EC27 5D5B
>>>
>>
Re: flink operator sometimes cannot start jobmanager after upgrading
Posted by Őrhidi Mátyás <ma...@gmail.com>.
This can be reproduced simply by deleting the kubernetes deployment. The
operator cannot recover from this state automatically, by defining a
restartNonce on the deployment should recover the state.
Regards,
Matyas
On Mon, May 2, 2022 at 10:00 AM Márton Balassi <ba...@gmail.com>
wrote:
> Hi ChangZhuo,
>
> Thanks for reporting this, I think I have just run into this myself too.
> Will try to reproduce it, but I do not fully comprehend it yet. If anyone
> has a way to reproduce it is more than welcome. :-)
>
> On Fri, Apr 29, 2022 at 12:16 PM ChangZhuo Chen (陳昌倬) <cz...@czchen.org>
> wrote:
>
>> Hi,
>>
>> We found that flink operator [0] sometimes cannot start jobmanager after
>> upgrading FlinkDeployment. We need to recreate FlinkDeployment to fix
>> the problem. Anyone has this issue?
>>
>> The following is redacted log from flink operator. After status becomes
>> MISSING, it keeps in MISSING status for at least 15 minutes.
>>
>>
>> 2022-04-29 09:41:15,141 o.a.f.c.d.a.c.ApplicationClusterDeployer
>> [INFO ][namespace/flink-deployment-name] Submitting application in
>> 'Application Mode'.
>> 2022-04-29 09:41:15,145 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
>> memory (2.400gb (2576980416 bytes)) is greater than its max value
>> 1024.000mb (1073741824 bytes), max value will be used instead
>> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>> ][namespace/flink-deployment-name] The derived from fraction jvm overhead
>> memory (5.200gb (5583457568 bytes)) is greater than its max value
>> 1024.000mb (1073741824 bytes), max value will be used instead
>> 2022-04-29 09:41:15,146 o.a.f.r.u.c.m.ProcessMemoryUtils [INFO
>> ][namespace/flink-deployment-name] The derived from fraction network memory
>> (5.050gb (5422396292 bytes)) is greater than its max value 4.000gb
>> (4294967296 bytes), max value will be used instead
>> 2022-04-29 09:41:15,237 o.a.f.k.u.KubernetesUtils [INFO
>> ][namespace/flink-deployment-name] Kubernetes deployment requires a fixed
>> port. Configuration high-availability.jobmanager.port will be set to 6123
>> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [WARN
>> ][namespace/flink-deployment-name] Please note that Flink client
>> operations(e.g. cancel, list, stop, savepoint, etc.) won't work from
>> outside the Kubernetes cluster since 'kubernetes.rest-service.exposed.type'
>> has been set to ClusterIP.
>> 2022-04-29 09:41:15,508 o.a.f.k.KubernetesClusterDescriptor [INFO
>> ][namespace/flink-deployment-name] Create flink application cluster
>> flink-deployment-name successfully, JobManager Web Interface:
>> http://flink-deployment-name.namespace:8081
>> 2022-04-29 09:41:15,510 o.a.f.k.o.s.FlinkService [INFO
>> ][namespace/flink-deployment-name] Application cluster successfully deployed
>> 2022-04-29 09:41:15,583 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:15,684 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:15,686 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:15,792 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager is being deployed
>> 2022-04-29 09:41:15,792 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:20,795 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:20,797 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:20,896 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager is being deployed
>> 2022-04-29 09:41:20,897 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:25,899 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:25,901 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:25,997 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager is being deployed
>> 2022-04-29 09:41:25,998 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:29,518 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:29,520 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:30,631 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager is being deployed
>> 2022-04-29 09:41:30,631 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:35,639 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:35,640 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:35,756 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager is being deployed
>> 2022-04-29 09:41:35,756 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:40,759 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:40,760 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:40,864 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager is being deployed
>> 2022-04-29 09:41:40,864 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:45,867 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:45,868 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYING
>> 2022-04-29 09:41:45,870 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager deployment port is ready,
>> waiting for the Flink REST API...
>> 2022-04-29 09:41:45,870 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:55,901 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: DEPLOYED_NOT_READY
>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager deployment is ready
>> 2022-04-29 09:41:55,902 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing job status
>> 2022-04-29 09:41:56,294 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] No job found on cluster yet
>> 2022-04-29 09:41:56,294 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:41:58,443 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:41:58,445 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing job status
>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver
>> [ERROR][namespace/flink-deployment-name] Exception while listing jobs
>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: READY
>> 2022-04-29 09:42:10,489 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>> 2022-04-29 09:42:10,490 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:42:25,521 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: MISSING
>> 2022-04-29 09:42:25,522 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>> 2022-04-29 09:42:25,522 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> 2022-04-29 09:42:40,526 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: MISSING
>> 2022-04-29 09:42:40,527 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>> 2022-04-29 09:42:40,527 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>> ...
>>
>> 2022-04-29 10:00:55,862 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Starting reconciliation
>> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] Observing JobManager deployment.
>> Previous status: MISSING
>> 2022-04-29 10:00:55,863 o.a.f.k.o.o.JobObserver [INFO
>> ][namespace/flink-deployment-name] JobManager deployment does not exist
>> 2022-04-29 10:00:55,863 o.a.f.k.o.c.FlinkDeploymentController [INFO
>> ][namespace/flink-deployment-name] Reconciliation successfully completed
>>
>>
>> [0] https://github.com/apache/flink-kubernetes-operator
>>
>>
>> --
>> ChangZhuo Chen (陳昌倬) czchen@{czchen,debian}.org
>> http://czchen.info/
>> Key fingerprint = BA04 346D C2E1 FE63 C790 8793 CC65 B0CD EC27 5D5B
>>
>