You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@activemq.apache.org by "Sebastian Lövdahl (Jira)" <ji...@apache.org> on 2020/11/04 06:31:00 UTC

[jira] [Commented] (ARTEMIS-2892) Replication stopped working after many failovers

    [ https://issues.apache.org/jira/browse/ARTEMIS-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17225888#comment-17225888 ] 

Sebastian Lövdahl commented on ARTEMIS-2892:
--------------------------------------------

We just saw this in production as well. We're running 2.10.1 at the moment. I don't know much about the internals here, but is there any chance that https://issues.apache.org/jira/browse/ARTEMIS-2868 that was fixed in 2.15.0 could have solved this?

> Replication stopped working after many failovers
> ------------------------------------------------
>
>                 Key: ARTEMIS-2892
>                 URL: https://issues.apache.org/jira/browse/ARTEMIS-2892
>             Project: ActiveMQ Artemis
>          Issue Type: Bug
>          Components: Broker
>    Affects Versions: 2.11.0
>            Reporter: Chris Oman
>            Priority: Major
>              Labels: replication
>         Attachments: artemis-1017.log, artemis-1019.log
>
>
> We have 2 servers that we have been testing the failover capabilities with. We have encountered a situation where replication has completely stopped, but the master is working properly otherwise.
> When the backup attempts to connect, the master has this in its logs.
> {code:java}
> 2020-09-01 11:23:31,177 WARN  [org.apache.activemq.artemis.core.server] AMQ222013: Error when trying to start replication: ActiveMQInternalErrorException[errorType=INTERNAL_ERROR message=AMQ229006: journals are not JournalImpl. You can''t set a replicator!]
>         at org.apache.activemq.artemis.core.persistence.impl.journal.JournalStorageManager.startReplication(JournalStorageManager.java:628) [artemis-server-2.11.0.jar:2.11.0]
>         at org.apache.activemq.artemis.core.server.impl.SharedNothingLiveActivation$2.run(SharedNothingLiveActivation.java:178) [artemis-server-2.11.0.jar:2.11.0]
>         at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_201]
> {code}
> How can this happen? Up until this point, replication was working properly through all of the failovers (10+).
> I have attached the logs files from both systems.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)