You are viewing a plain text version of this content. The canonical link for it is here.
Posted to gitbox@hive.apache.org by GitBox <gi...@apache.org> on 2020/02/18 20:08:04 UTC

[GitHub] [hive] prasanthj opened a new pull request #917: HIVE-22359: LLAP: when a node restarts with the exact same host/port in kubernetes it is not detected as a task failure

prasanthj opened a new pull request #917: HIVE-22359: LLAP: when a node restarts with the exact same host/port in kubernetes it is not detected as a task failure
URL: https://github.com/apache/hive/pull/917
 
 
   In kubernete environments, the hostnames and ports are same for LLAP service but IP address of pods can change. There are some assumptions in LLAP that handles hostname:port and caches connections based on that. Also AM thinks that certain host is running some task attempts but when the LLAP pod restarts all the tasks on that node gets killed or replaced with new tasks in which case LLAP will heartbeat with different task attempts which AM does not expect. 
   
   This PR fixes 2 issues
   - Includes IP address in hostId that is used for caching RPC connections
   - When AM expects some tasks to be there on some node and if does not exists then it will kill those task attempts so that it gets rescheduled.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: gitbox-unsubscribe@hive.apache.org
For additional commands, e-mail: gitbox-help@hive.apache.org