You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@qpid.apache.org by "Ted Ross (JIRA)" <ji...@apache.org> on 2016/06/08 12:35:21 UTC

[jira] [Commented] (DISPATCH-368) Router in bad state in two inter-connected routers

    [ https://issues.apache.org/jira/browse/DISPATCH-368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15320460#comment-15320460 ] 

Ted Ross commented on DISPATCH-368:
-----------------------------------

The fix for DISPATCH-364 should also fix this issue.

> Router in bad state in two inter-connected routers
> --------------------------------------------------
>
>                 Key: DISPATCH-368
>                 URL: https://issues.apache.org/jira/browse/DISPATCH-368
>             Project: Qpid Dispatch
>          Issue Type: Bug
>          Components: Routing Engine
>    Affects Versions: 0.6.0
>         Environment: Debian 8.3, Apache Qpid Proton 0.12.2 for drivers and dependencies, Hardware: 2 CPUs, 15 GB RAM, 60 GB HDD on 2 separate machines
>            Reporter: Eric Leu
>
> The setup of two inter-connected routers is the same as in DISPATCH-358. Let the routers be A and B.  Senders connect to A and receivers connect to B. While senders are sending messages to A, restart router B every 10 sec.  Senders check tracker status to make sure messages are accepted by the receivers.  After running for some time, router A is in bad state.   No messages sent are accepted.   After that point, I keep router B up without restarting.  The problem does not go away.  Restart senders and receivers and does not help.  I also notice the error in the log:
> ============================
> 2016-06-07 14:13:39.287550 -0700 AGENT (debug) Add entity: RouterNodeEntity (address=amqp:/_topo/0/Router.A.1, cost=None, id=Router.A.1, instance=1465333922, linkState=[], 
> nextHop=None, routerLink=None, type=org.apache.qpid.dispatch.router.node, validOrigins=None)
> 2016-06-07 14:13:39.294891 -0700 ROUTER (error) Control message error: opcode=HELLO body=
> {'seen': ['Router.A.0'], 'area': '0', 'id': 'Router.A.1', 'instance': 1465333922L}
> Traceback (most recent call last):
>   File "/usr/local/lib/qpid-dispatch/python/qpid_dispatch_internal/router/engine.py", line 137, 
> in handleControlMessage
>     self.hello_protocol.handle_hello(msg, now, link_id, cost)
>   File "/usr/local/lib/qpid-dispatch/python/qpid_dispatch_internal/router/hello.py", line 55, 
> in handle_hello
>     self.node_tracker.neighbor_refresh(msg.id, msg.instance, link_id, cost, now)
>   File "/usr/local/lib/qpid-dispatch/python/qpid_dispatch_internal/router/node.py", line 207, 
> in neighbor_refresh
>     if node.set_link_id(link_id):
>   File "/usr/local/lib/qpid-dispatch/python/qpid_dispatch_internal/router/node.py", line 410, 
> in set_link_id
>     self.adapter.set_link(self.maskbit, link_id)
> OverflowError: signed integer is greater than maximum



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@qpid.apache.org
For additional commands, e-mail: dev-help@qpid.apache.org