First Encounter of HA Failover

By admin, April 29, 2015 5:12 pm

Found one of the hosts performed HA Failover on its own at midnight today and all its associated VMs were restarted on the other hosts automatically. The exact cause of host restart is still under investigation as there is no hardware failure found at all.

12:17 Host is Not responding to vCenter Ping, so it triggered HA Failover. In fact, vCenter took about 50 seconds to realize the actual host failure.

12:24 It took 7 Minutes for the host to re-join vCenter cluster. In fact the host & ESX restart only took about 5 minutes counting from the boot screen.

12:28 Everything went back to green, the whole process took about 10 minutes. For individual VM, there was 5-6 minutes of downtime unfortunately.

Cluster Level Event Log

cluster

Host Level Event Log

host