You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by GitBox <gi...@apache.org> on 2020/08/03 22:32:57 UTC

[GitHub] [airflow] iantbutler01 commented on issue #10122: KubernetesPodOperator specifically takes a very long time to update to failed state after task fails.

iantbutler01 commented on issue #10122:
URL: https://github.com/apache/airflow/issues/10122#issuecomment-668273631


   KubernetesPodOperator is a red herring. I believe I know the issue and it should be addressed as it will happen on every failure that attempts to send an email.
   
   s = smtplib.SMTP_SSL(SMTP_HOST, SMTP_PORT) if SMTP_SSL else smtplib.SMTP(SMTP_HOST, SMTP_PORT)
   does not have a timeout set. In this case it falls back to socket._GLOBAL_DEFAULT_TIMEOUT which in the case of socket.connect is treated as None. This means it will hang indefinitely if it is unable to make the SMPT connection.
   
   I am testing the fix on my end right now assuming all good I think a PR should be made to set a default timeout of say 60 seconds. I am happy to make and test that PR if it's wanted.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org