HA basics
Requirements;
- shared storage
- Common networks
- Ideally similar (or identical) hardware for each host
A good way to check that all hosts have access to the same networks and datastores is to use the ‘Maps’ feature. Select your cluster then deselect every option except ‘Host to Network’ or ‘Host to Datastore;
As you can see in this diagram the ’15 VLAN’ portgroup is not presented to every host (it’s slightly removed from the circle) and at least one VM in the cluster has a network assigned (in the top right) which isn’t available in this cluster at all.
Clusters consist of up to 32 hosts. The first five hosts in a cluster will be primaries, the rest secondaries. You can’t set a host to primary or secondary using the VI client, but you can using the AAM CLI (not supported, see how in this Yellow bricks article). One of the primaries will be the ‘active primary’ which collates resource information and places VMs after a failover event.
Heartbeat options and dependencies
Heartbeats are used to determine whether a host is still operational
Heartbeats use the service console networks by default, or the management network for ESXi hosts.
They’re sent every second by default. Can be amended using das.failuredetectioninterval
Primaries send heartbeats to both other primaries and secondaries, secondaries only send to primaries.
After no heartbeats have been received for 13 seconds the host will ping its isolation address.
HA operates even when vCentre is down (the AAM agent talks directly from host to host), although vCentre is required when first enabling HA on a cluster.
Diagnosing issues with heartbeats – see VMware KB1010991