You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@activemq.apache.org by "Catalin Alexandru Zamfir (JIRA)" <ji...@apache.org> on 2018/04/18 09:48:00 UTC

[jira] [Commented] (ARTEMIS-1751) Artemis backup servers hawtio web management doesn't come online

    [ https://issues.apache.org/jira/browse/ARTEMIS-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16442197#comment-16442197 ] 

Catalin Alexandru Zamfir commented on ARTEMIS-1751:
---------------------------------------------------

Hello,

We have a live/backup1/backup2 set-up. When doing failovers (we're testing the different scenarios), the Hawtio console does not start back-up with the live server. We have a healtheck in Docker on 8161 and when our current "live" has come back the console is not active.

 
{code:java}
db21d0feaebb        artifactory.d.g.org/external/default/production/artemis:2.5.0                "/var/lib/artemis/..."   2 hours ago         Up 32 minutes (unhealthy)                       i1-shared-artemis-r1-p
{code}
The live server for us is i1-shared-artemis-r1-p. We' know this from logs:
{code:java}
2018-04-18 09:07:52.595 dlk-p-mqs001 dockerc: i1-shared-artemis-r1-p: 2018-04-18 09:07:49,118 INFO [org.apache.activemq.artemis.core.server] AMQ221020: Started EPOLL Acceptor at 10.117.28.32:61616 for protocols [CORE,MQTT,AMQP,STOMP,HORNETQ,OPENWIRE]
2018-04-18 09:07:52.595 dlk-p-mqs001 dockerc: i1-shared-artemis-r1-p: 2018-04-18 09:07:49,063 INFO [org.apache.activemq.artemis.core.server] AMQ221007: Server is now live
2018-04-18 09:07:52.595 dlk-p-mqs001 dockerc: i1-shared-artemis-r1-p: 2018-04-18 09:07:48,448 INFO [org.apache.activemq.artemis.core.server] AMQ221071: Failing over based on quorum vote results.
{code}
But trying to contact the port (command line or browser) we cannot connect:
{code:java}
root@dlk-p-mqs001:/ # nc -zv i1-shared-artemis-r1-p.d.g.org 8161
nc: connect to i1-shared-artemis-r1-p.d.g.org port 8161 (tcp) failed: Connection refused
root@dlk-p-mqs001:/ # date
Wed Apr 18 09:44:04 UTC 2018{code}
Healthcheck (in Compose format):
{code:java}
healthcheck:
  test: [CMD, nc, -zv, i1-shared-artemis-r1-p.d.g.org, '8161']
  interval: 90s
  timeout: 10s
  retries: 3
  start_period: 60s

{code}

> Artemis backup servers hawtio web management doesn't come online
> ----------------------------------------------------------------
>
>                 Key: ARTEMIS-1751
>                 URL: https://issues.apache.org/jira/browse/ARTEMIS-1751
>             Project: ActiveMQ Artemis
>          Issue Type: Bug
>         Environment: Artemis 2.4.0
>            Reporter: Ovidiu Stroe
>            Priority: Major
>             Fix For: 2.5.0
>
>
> After a backup server is assumes the master role and the goes back to slave the hawtio interface is no longer available .
> Steps to reproduce :(
> 1. Start live node
> 2. Start backup node
> 3. Open hawtio console on live - note it works
> 4. Open hawtio console on backup - note it works
> 5. Stop live node
> 6. Note backup goes live, hawtio console works
> 7. Start live again
> 8. Check backup hawtio - it's completely down



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)