You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "wgcn (JIRA)" <ji...@apache.org> on 2019/06/04 11:08:00 UTC
[jira] [Updated] (FLINK-12728) taskmanager container can't launch on nodemanager machine because of kerberos

     [ https://issues.apache.org/jira/browse/FLINK-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

wgcn updated FLINK-12728:
-------------------------
    Description: 
    job can't restart when flink  job  has been running for a long time and then taskmanager restarting   ,i find log in AM   that  AM  request containers  taskmanager  all the time .      the  log in NodeManager show that  the new requested containers can't  downloading file from hdfs  because of kerberos . I  configed the keytab config that

security.kerberos.login.use-ticket-cache: false
 security.kerberos.login.keytab: /data/sysdir/knit/user/.flink.keytab
 security.kerberos.login.principal: [flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2. |mailto:flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2.]

 at  flink-client machine  and  keytab  is exist.  

I showed the logs at AM and NodeManager below.

 

 

 

 

  was:
    job can't restart when flink  job  has been running for a long time and then taskmanager restarting   ,i find log in AM   that  AM  request containers  taskmanager  all the time . log in NodeManager show that  the new requested containers can't  downloading file from hdfs  because of kerberos . I  configed the keytab config that

security.kerberos.login.use-ticket-cache: false
 security.kerberos.login.keytab: /data/sysdir/knit/user/.flink.keytab
 security.kerberos.login.principal: [flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2. |mailto:flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2.]

 at  flink-client machine  and  keytab  is exist.  

I showed the logs at AM and NodeManager below.

 

 

 

 


>   taskmanager  container  can't  launch  on nodemanager machine because of kerberos
> -----------------------------------------------------------------------------------
>
>                 Key: FLINK-12728
>                 URL: https://issues.apache.org/jira/browse/FLINK-12728
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN
>    Affects Versions: 1.7.2
>         Environment: linux 
> jdk8
> hadoop 2.7.2
> flink 1.7.2
>            Reporter: wgcn
>            Priority: Major
>         Attachments: AM.log, NM.log
>
>
>     job can't restart when flink  job  has been running for a long time and then taskmanager restarting   ,i find log in AM   that  AM  request containers  taskmanager  all the time .      the  log in NodeManager show that  the new requested containers can't  downloading file from hdfs  because of kerberos . I  configed the keytab config that
> security.kerberos.login.use-ticket-cache: false
>  security.kerberos.login.keytab: /data/sysdir/knit/user/.flink.keytab
>  security.kerberos.login.principal: [flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2. |mailto:flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2.]
>  at  flink-client machine  and  keytab  is exist.  
> I showed the logs at AM and NodeManager below.
>  
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)