You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@cloudstack.apache.org by "sadhu suresh (JIRA)" <ji...@apache.org> on 2014/07/18 20:22:05 UTC
[jira] [Updated] (CLOUDSTACK-7131) RVR: router's reduandant state
shown as unknown(CheckRouterCommand is failing)
[ https://issues.apache.org/jira/browse/CLOUDSTACK-7131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
sadhu suresh updated CLOUDSTACK-7131:
-------------------------------------
Attachment: management-server.rar
management log
> RVR: router's reduandant state shown as unknown(CheckRouterCommand is failing)
> ------------------------------------------------------------------------------
>
> Key: CLOUDSTACK-7131
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-7131
> Project: CloudStack
> Issue Type: Bug
> Security Level: Public(Anyone can view this level - this is the default.)
> Components: Management Server
> Affects Versions: 4.5.0
> Reporter: sadhu suresh
> Attachments: management-server.rar
>
>
> 1. advance zone with vmware cluster
> 2. create network offering with RVR enabled
> 3.create a network with above network offering
> 4.deploy a vm using above network
> 5.once redundant router created successfully(master and backup routers)
> 6.reboot the master router
> 7. check the redundant state of the router.
> actual result:
> when we reboot the master ,backup becomes master and rebooted master becomes backup as expected but after sometime(next day whenicheck the logs i am continousily seeing the checkrouter command is failing and both routers redundant state set as unknown)
> Name r-6-VM
> ID 83fe0027-d541-4795-9d0e-7306a068a9f4
> State Running
> Version 4.4.0
> Requires Upgrade No
> Network ID 63e9f4c8-da5c-42ca-93ea-318f56ab8e79
> Public IP Address 10.147.49.185
> Guest IP Address 10.1.1.138
> Link Local IP Address 10.147.41.20
> Host 10.147.40.9
> Compute offering System Offering For Software Router
> Network Domain cs2cloud.internal
> Domain ROOT
> Account admin
> Created 15 Jul 2014 21:21:40
> Redundant Router Yes
> Redundant state UNKNOWN
> VPC ID
> root@r-6-VM:~# cat /ramdisk/rrouter/keepalived.log
> To backup called
> Disable public ip 0
> Password server is not running
> Stopping DNS forwarder and DHCP server: dnsmasq(not running) ... (warning).
> cache internal:
> current active connections: 0
> connections created: 0 failed: 0
> connections updated: 0 failed: 0
> connections destroyed: 0 failed: 0
> cache external:
> current active connections: 0
> connections created: 0 failed: 0
> connections updated: 0 failed: 0
> connections destroyed: 0 failed: 0
> traffic processed:
> 0 Bytes 0 Pckts
> multicast traffic (active device=eth0):
> 16 Bytes sent 0 Bytes recv
> 2 Pckts sent 0 Pckts recv
> 0 Error send 0 Error recv
> message tracking:
> 0 Malformed msgs 0 Lost msgs
> Conntrackd switch to backup done
> Switch conntrackd mode backup 0
> Status: BACKUP
> root@r-7-VM:~# cat /ramdisk/rrouter/keepalived.log
> To backup called
> Disable public ip 0
> Password server is not running
> Stopping DNS forwarder and DHCP server: dnsmasq.
> cache internal:
> current active connections: 0
> connections created: 0 failed: 0
> connections updated: 0 failed: 0
> connections destroyed: 0 failed: 0
> cache external:
> current active connections: 0
> connections created: 0 failed: 0
> connections updated: 0 failed: 0
> connections destroyed: 0 failed: 0
> traffic processed:
> 0 Bytes 0 Pckts
> multicast traffic (active device=eth0):
> 16 Bytes sent 24 Bytes recv
> 1 Pckts sent 2 Pckts recv
> 0 Error send 0 Error recv
> message tracking:
> 0 Malformed msgs 0 Lost msgs
> Conntrackd switch to backup done
> Switch conntrackd mode backup 0
> Status: BACKUP
> To master called
> Password server is not running
> Removed cloud-passwd-srvr iptables rules
> Added cloud-passwd-srvr iptables rules
> 10.1.1.117/24 10.1.1.1/24
> Restarting DNS forwarder and DHCP server: dnsmasq.
> Enable public ip returned 0
> Conntrackd switch to primary done
> Switch conntrackd mode primary returned 0
> ARPING 10.147.49.185 from 10.147.49.185 eth2
> Sent 1 probes (1 broadcast(s))
> Received 0 response(s)
> ARPING 10.147.49.185 from 10.147.49.185 eth2
> Sent 1 probes (1 broadcast(s))
> Received 0 response(s)
> Status: MASTER
> root@r-7-VM:~# ls
> clearUsageRules.sh func.sh hv-kvp-daemon_3.1_amd64.deb monitorServices.py reconfigLB.sh redundant_router
> root@r-7-VM:~# cd /ramdisk/rrouter/
> root@r-7-VM:/ramdisk/rrouter# ls
> arping_gateways.sh check_bumpup.sh disable_pubip.sh fault.sh keepalived.log keepalived.ts2 primary-backup.sh
> backup.sh check_heartbeat.sh enable_pubip.sh heartbeat.sh keepalived.ts master.sh services.sh
> root@r-7-VM:/ramdisk/rrouter#
> content of log:
> **********
> Done with process of VM state report. host: 1
> 2014-07-19 04:55:53,680 ERROR [c.c.u.s.SshHelper] (DirectAgent-467:ctx-4099f7a4 10.147.40.9, job-88, cmd: CheckRouterCommand) SSH execution of command /opt/cloud/bin/checkrouter.sh null has an error status code in return. result output:
> 2014-07-19 04:55:53,686 DEBUG [c.c.h.v.r.VmwareResource] (DirectAgent-467:ctx-4099f7a4 10.147.40.9, job-88, cmd: CheckRouterCommand) checkrouter.sh execution result: false
> 2014-07-19 04:55:53,688 DEBUG [c.c.a.m.DirectAgentAttache] (DirectAgent-467:ctx-4099f7a4) Seq 1-3851140631355196160: Response Received:
> 2014-07-19 04:55:53,692 ERROR [c.c.u.s.SshHelper] (DirectAgent-385:ctx-ebbeb14f 10.147.40.9, job-82, cmd: CheckRouterCommand) SSH execution of command /opt/cloud/bin/checkrouter.sh null has an error status code in return. result output:
> 2014-07-19 04:55:53,702 DEBUG [c.c.h.v.r.VmwareResource] (DirectAgent-385:ctx-ebbeb14f 10.147.40.9, job-82, cmd: CheckRouterCommand) checkrouter.sh execution result: false
> 2014-07-19 04:55:53,706 DEBUG [c.c.a.m.DirectAgentAttache] (DirectAgent-385:ctx-ebbeb14f) Seq 1-3851140631355196172: Response Received:
> 2014-07-19 04:55:53,703 DEBUG [c.c.a.t.Request] (DirectAgent-467:ctx-4099f7a4) Seq 1-3851140631355196160: Processing: { Ans: , MgmtId: 7175246184473, via: 1, Ver: v1, Flags: 10, [{"com.cloud.agent.api.CheckRouterAnswer":{"isBumped":false,"result":false,"details":"","wait":0}}] }
> 2014-07-19 04:55:53,716 DEBUG [c.c.a.t.Request] (DirectAgent-385:ctx-ebbeb14f) Seq 1-3851140631355196172: Processing: { Ans: , MgmtId: 7175246184473, via: 1, Ver: v1, Flags: 10, [{"com.cloud.agent.api.CheckRouterAnswer":{"isBumped":false,"result":false,"details":"","wait":0}}] }
> 2014-07-19 04:55:53,719 DEBUG [c.c.a.m.AgentAttache] (DirectAgent-385:ctx-ebbeb14f) Seq 1-3851140631355196172: Unable to find listener.
> 2014-07-19 04:55:53,723 DEBUG [c.c.a.m.AgentAttache] (DirectAgent-467:ctx-4099f7a4) Seq 1-3851140631355196160: Unable to find listener.
> 2014-07-19 04:55:53,731 DEBUG [c.c.h.v.r.VmwareResource] (DirectAgent-91:ctx-4f8e6182 10.147.40.9, job-82, cmd: CheckRouterCommand) Use router's private IP for SSH control. IP : 10.147.41.20
> 2014-07-19 04:55:53,746 DEBUG [c.c.h.v.r.VmwareResource] (DirectAgent-91:ctx-4f8e6182 10.147.40.9, job-82, cmd: CheckRouterCommand) Run command on VR: 10.147.41.20, script: checkrouter.sh with args: null
> 2014-07-19 04:55:53,751 DEBUG [c.c.h.v.r.VmwareResource] (DirectAgent-176:ctx-95fd6537 10.147.40.9, job-83, cmd: CheckRouterCommand) Use router's private IP for SSH control. IP : 10.147.41.30
> 2014-07-19 04:55:53,759 DEBUG [c.c.h.v.r.VmwareResource] (DirectAgent-176:ctx-95fd6537 10.147.40.9, job-83, cmd: CheckRouterCommand) Run command on VR: 10.147.41.30, script: checkrouter.sh with args: null
--
This message was sent by Atlassian JIRA
(v6.2#6252)