XenServer stay in Disabled state after rebooting, and after a while it got disconnected from pool.
Try to enable host by CLI, it reports "Pool Master is unreachable"
[root@test-xs ~]# xe host-enable host=test-xs
Seen from /var/log/xensource.log like below:
- Mar 16 11:28:48 test-xs xapi: [debug||0 |bringing up management interface D:b81ee3b0d9e3|xapi] Management IP address is: 192.168.172.2
- Mar 16 11:28:48 test-xs xapi: [error||0 |bringing up management interface D:b81ee3b0d9e3|master_connection] Caught Master_connection.Goto_handler
- Mar 16 11:28:48 test-xs xapi: [debug||11 ||dummytaskhelper] task dom0 networking update D:0980145a5128 created by task D:d04c84090a7b
- Mar 16 11:28:48 test-xs xapi: [debug||0 |bringing up management interface D:b81ee3b0d9e3|master_connection] Connection to master died. I will continue to retry indefinitely (supressing future logging of this message).
- Mar 16 11:28:48 test-xs xapi: [debug||11 |dom0 networking update D:0980145a5128|xapi_mgmt_iface] Checking to see if hostname or management IP has changed
- Mar 16 11:28:48 test-xs xapi: [error||0 |bringing up management interface D:b81ee3b0d9e3|master_connection] Connection to master died. I will continue to retry indefinitely (supressing future logging of this message).
Verify eth1 is working by shutdown interface eth0, ping is available and host is still online:
[root@test-xs ~]# ip link set eth0 down
For Active/Passive bond:
[root@test-xs ~]# ovs-appctl bond/show
for LACP bond:
[root@test-xs ~]# ovs-appctl lacp/show
Verify eth0 is working by shutdown interface eth1, ping is available and host is still online:
[root@test-xs ~]# ip link set eth0 up
[root@test-xs ~]# ip link set eth1 down
For Active/Passive bond:
[root@test-xs ~]# ovs-appctl bond/show
For LACP bond:
[root@test-xs ~]# ovs-appctl lacp/show
Check ls -lh /var/xapi/state.db has over
35MBBy using tcpdump that captures traffic between slave and master hosts, we can see there are many
TCP ZeroWindow indicating master sends packet too fast:
17:29:31.300871 0.003255 192.168.172.2 192.168.172.1 TCP [TCP ZeroWindow] 48742 > https [ACK] Seq=677 Ack=8135441 Win=0 Len=0 TSV=2591394380 TSER=195336920617:29:31.301006 0.000135 192.168.172.2 192.168.172.1 TCP [TCP Window Update] 48742 > https [ACK] Seq=677 Ack=8135441 Win=85760 Len=0 TSV=2591394381 TSER=1953369206