Chapter 9 - Resilience and Physical Security Flashcards
Continuity of Operations
Continuity of Operations means ensuring that operations will continue even if issues ranging from single system failures to wide-scale natural disasters occur.
Geographic Dispersion
Geographic Dispersion of systems ensures that a single disaster, attack, or failure cannot disable or destroy them.
Load Balancing
Load Balancing makes multiple systems or services appear to be a single resource, allowing both redundancy and increased ability to handle loads by distributing them to more than one system.
Clustering
Clustering describes groups of computers connected together to perform the same task. Some examples include providing a front-end for a web application or grouping together worker nodes of a supercomputer.
Platform Diversity
Platform Diversity is a means of building resilience into an infrastructure. Using different vendors, cryptographic solutions, platforms, and controls can make it more difficult for a single attack or failure to have system- or organization-wide impacts.
Redundant Array of Inexpensive Disks (RAID)
RAID is a common solution to storage resiliency that uses multiple disks with data either striped (spread across disks) or mirrored (completely duplicated), and technology to ensure that data is not corrupted or lost (parity). RAID ensures that an array can handle one or more disk failures without losing data.
RAID-0
RAID-0 offers striping, spreading data across all drives in the array. This allows for better I/O performance. This is NOT fault tolerant. All data is lost if a single drive is lost.
RAID-1
RAID-1 offers mirroring, duplicating all data to another drive or drives in the array. This uses twice the amount of storage to ensure that data is still available if a drive fails. This solution also offers high read speeds.
RAID-5
RAID-5 offers both striping and parity. Data is striped across the drives with one of the drives being used for parity (checksum) of the data. This solution can tolerate only a single drive failure at a time. Drive failures can be rebuilt as long as only a single drive fails. Data reads are fast while data writes are slightly slower.
RAID-10
RAID-10 (requires at least four drives) offers both mirroring and striping. This means that data is perfectly duplicated across drives and a drive is also used for parity.
Replication
Replication focuses on using either synchronous or asynchronous methods to copy live data to another location or device. Replication is always occurring as changes are made. Replication helps with multisite, multi-system designs, ensuring that changes are carried over to all systems or clusters that are part of an architecture.
Journaling
Journaling creates a log of changes that can be reapplied if an issue occurs. Journaling is commonly used for databases and similar technologies that combine frequent changes with an ability to restore to a point in time. Journals should be backed up somewhere else as they are simply maintained on the source system.
Snapshot
Snapshots capture the full state of a system or device at the time the backup is completed. Snapshots can be useful to clone systems, to go back in time to a point before a patch or upgrade was installed, or to restore a system state to a point before some other event occurred. Snapshots are common for virtual machines.
Off-Site Storage
Off-Site Storage for backup media, either at a self-owned site or at a location provided by a third-party service provider, is a common form of geographic dispersion/diversity.
Scalability
Scalability is a common design element and a useful response control for many systems in modern environments, where services are designed to scale across many servers instead of requiring a larger server to handle more workload.