You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Wenping Zhan (Jira)" <ji...@apache.org> on 2022/04/25 02:15:00 UTC

[jira] [Commented] (HIVE-10570) HiveServer2 shut downs due to temporary ZooKeeper unavailability, causes permanent outage instead of temporary

    [ https://issues.apache.org/jira/browse/HIVE-10570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17527279#comment-17527279 ] 

Wenping Zhan commented on HIVE-10570:
-------------------------------------

I‘m not sure whether I am facing the same issue. Because temporary network issues lead zookeeper occurs  expiring session with hiveserver2(eg:Expiring session 0x100013f4434000b, timeout of 120000ms exceeded), I found hiveserver2 znode removed by zookeeper, and I got the similar hs2 server shut down log info like above in Hive 2.3.6, finally I need to restart hiveserver2 to recovery. How to resolve the issues? 

> HiveServer2 shut downs due to temporary ZooKeeper unavailability, causes permanent outage instead of temporary
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-10570
>                 URL: https://issues.apache.org/jira/browse/HIVE-10570
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2
>    Affects Versions: 0.14.0
>         Environment: HDP 2.2
>            Reporter: Hari Sekhon
>            Priority: Critical
>
> HiveServer2 should not shut down when there is temporary ZooKeeper unavailability (eg. temporary network outage). This prevents retry and recovery later as HiveServer2 is no longer running and therefore cannot retry - HiveServer2 stays offline indefinitely until operator intervention to restart it, even for minor temporary problems.
> I believe this behaviour is due to recent ZooKeeper dependency addition for HiveServer2 HA.
> {code}2015-05-01 11:35:05,367 WARN  zookeeper.ClientCnxn (ClientCnxn.java:run(1102)) - Session 0x14d004cb02c001e for server null, unexpected error, closing socket
> connection and attempting reconnect
> java.net.SocketException: Network is unreachable
>         at sun.nio.ch.Net.connect0(Native Method)
>         at sun.nio.ch.Net.connect(Net.java:465)
>         at sun.nio.ch.Net.connect(Net.java:457)
>         at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:670)
>         at org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
>         at org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
>         at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:967)
>         at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1003)
> 2015-05-01 11:35:05,629 INFO  client.ZooKeeperSaslClient (ZooKeeperSaslClient.java:run(285)) - Client will use GSSAPI as SASL mechanism.
> 2015-05-01 11:35:05,630 INFO  zookeeper.ClientCnxn (ClientCnxn.java:logStartConnect(975)) - Opening socket connection to server <custom_scrubbed>/<ip>:2181. Will attempt to SASL-authenticate using Login Context section 'HiveZooKeeperClient'
> 2015-05-01 11:35:05,630 ERROR zookeeper.ClientCnxnSocketNIO (ClientCnxnSocketNIO.java:connect(289)) - Unable to open socket to <custom_scrubbed>/<ip>:2181
> 2015-05-01 11:35:05,630 ERROR zookeeper.ClientCnxnSocketNIO (ClientCnxnSocketNIO.java:connect(289)) - Unable to open socket to <custom_scrubbed>/<ip>:2181
> 2015-05-01 11:35:05,630 WARN  zookeeper.ClientCnxn (ClientCnxn.java:run(1102)) - Session 0x14d004cb02c001e for server null, unexpected error, closing socket
> connection and attempting reconnect
> java.net.SocketException: Network is unreachable
>         at sun.nio.ch.Net.connect0(Native Method)
>         at sun.nio.ch.Net.connect(Net.java:465)
>         at sun.nio.ch.Net.connect(Net.java:457)
>         at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:670)
>         at org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
>         at org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
>         at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:967)
>         at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1003)
> 2015-05-01 11:35:05,943 INFO  server.HiveServer2 (HiveServer2.java:stop(299)) - Shutting down HiveServer2
> 2015-05-01 11:35:05,944 INFO  thrift.ThriftCLIService (ThriftCLIService.java:stop(137)) - Thrift server has stopped
> 2015-05-01 11:35:05,944 INFO  service.AbstractService (AbstractService.java:stop(125)) - Service:ThriftBinaryCLIService is stopped.
> 2015-05-01 11:35:05,944 INFO  service.AbstractService (AbstractService.java:stop(125)) - Service:OperationManager is stopped.
> 2015-05-01 11:35:05,944 INFO  service.AbstractService (AbstractService.java:stop(125)) - Service:SessionManager is stopped.
> 2015-05-01 11:35:05,946 INFO  server.HiveServer2 (HiveStringUtils.java:run(679)) - SHUTDOWN_MSG:
> /************************************************************
> SHUTDOWN_MSG: Shutting down HiveServer2 at <fqdn>/<ip>
> ************************************************************/{code}
> Hari Sekhon
> http://www.linkedin.com/in/harisekhon



--
This message was sent by Atlassian Jira
(v8.20.7#820007)