3.3 – High Availability and Disaster Recovery Flashcards
- Some servers are active, others are on standby- If an active server fails, the passive server takes its place
Load balancing
UPS
- Uninterruptible Power Supply - Short-term backup power- Blackouts, brownouts, surges
- Load Balancing / Fail Over (LBFO)- Aggregate bandwidth, redundant paths- Becomes more important in the virtual world
Multiple network adapters - Looks like a single adapter - Integrate with switches
NICs talk to each other- Usually multicast instead of broadcast - Fails over when a NIC doesn’t respond
NIC teaming
Measure of how well a computer network can respond to connectivity and performance demands that are placed upon it
Network Availability
- Provide multiple power outlets - Usually in a rack
- Often include monitoring and control - Manage power capacity- Enable or disable individual outlets
Power distribution units (PDUs)
no hardware
no data
no people
cold site
exact replica
duplicate
stocked with hardware
hot site
just enough to get going
in between cold and hot
hardware but no data
warm site
no separate facility
online management
data needs to move but not hardware
cloud site
Two devices are installed and configured - Only one operates at a time
If one device fails, the other takes over- Constant communication between the pair
Configuration and real-time session information is constantly synchronized
Active-passive
- You bought two devices- Use both at the same time
- More complex to design and operate- Data can flow in many different directions - A challenge to manage the flows- Monitoring and controlling data requires
a very good understanding of the underlying infrastructure
Active-active
- Your computer is configured with a single
default gateway- We need a way to provide availability if the
default gateway fails
FHRP (First Hop Redundancy Protocol)
- The default router isn’t real- Devices use a virtual IP for the default gateway- If a router disappears, another one takes its place - Data continues to flow
VRRP (Virtual Router Redundancy Protocol)
- Get up and running quickly- Get back to a particular service level
Recovery time objective (RTO)
Mean time to repair (MTTR)
- Time required to fix the issue