3,4 Flashcards
Cyber Resilience
ability to deliver outcomes despite adverse events
Redundancy
having additional systems or processes for continued functionality
High Availability
aims to keep services continuously available by minimizing downtime achieved through load balancing, clustering, redundancy and multi cloud strategies
Uptime
the time a system remains online
Give nines
refers to 99.999 uptime only allowing 5 min of downtime per year
six nines just 31 seconds of downtime per year
Load balancing
distributes workload across resources
Clustering
uses multiple computers, storage devices and network connections as a single systemRe
Redundancy
involves duplicating critical components to increase system reliability
Raid
redundant array of independent disks
combines physical storage devices into a single storage device recognized by the OS
RAID 0
striping
RAID 1
mirroring
RAID 5
striping with parity
RAID 6
similar to 5 but with double parity
RAID 10
combines 1 and 0 mirroring and striping
RAID Resilience Categoires
failure resistant - resists hardware malfunctions through redundancy (raid 1)
fault tolerant - allows continued operation and quick data rebuild in case of failure (raid 1 5 6 and 10)
disaster tolerant- safeguards against catetropic events by maintaining data in indepedent zones raid 1 and 10
Capacity Planning
Critical strategic planning effort for organizations
■ Ensures an organization is prepared to meet future demands in a cost-effective
manner
Aspects of Capacity Planning
People
technology
infrastructure
processes
Surge
sudden small increases in voltage beyond standard level
spike
short lived voltage increases lighting, tripped breakers etc
Sags
brief decrease in voltage enough to cause system shutdown
Undervoltage event
brownout
prolonged reduction leading to system shutdown
Power loss event
blackout
line conditioner
stabilize voltage supply and filter out fluctiations
mitigate surges, sags and undervoltage events
unsuitable for significant undervoltage events or complete power failures
UPS
Uninterruptible Power Supplies (UPS)
● Provide emergency power during power source failures
● Offer line conditioning functions
● Include battery backup to maintain power during short-duration failures
● Typically supply 15 to 60 minutes of power during a complete power failure
Generator
Convert mechanical energy into electrical energy for use in an externalcircuit through the process of electromagnetic induction
● Backup generators supply power during power grid outages
● Smaller generators for limited applications (e.g., emergency lighting)
Types of generators
Portable gas-engine generators
○ Permanently installed generators
○ Battery-inverter generator
PDC
Power distribution center
Central hub for power reception and distribution
● Includes circuit protection, monitoring, and load balancing
● Integrates with UPS and backup generators for seamless transitions during power event
RPO
ecovery Point Objective (RPO)
○ Ensures that the backup plan will maintain the amount of data required to keep any data loss under the organization’s RPO
threshold
Snapshots
Records only changes since the previous snapshot, reducing storage requirement
Point-in-time copies capturing a consistent state
Replication
real time or near real time replication of data in case of loss of system failure
Replication
real time or near real time data copying to maintain data continuity
Journaling
Maintaining a detailed record of data changes over time
COOP
Ensures an organization’s ability to recover from disruptive events or disasters
■ Requires detailed planning and forethought
BC Plan
business continuity planning
Plans and processes for responding to disruptive events
● Addresses a wide range of threats and disruptive incidents
● Involves preventative actions and recovery steps
● Can cover both technical and non-technical disruptions
DRP
disaster recovery plan
Focuses on plans and processes for disaster response
● Subset of the BC Plan
● Focuses on faster recovery after disasters
● Addresses specific events like hurricanes, fires, or flood
Redundant Site
Backup location or facility that can take over essential functions and operations
in case the primary site experiences a failure or disruptio
Hot site
Up and running continuously, enabling a quick switchover
● Requires duplicating all infrastructure and data
● Expensive, but provides instant availabilit
Warm site
Not fully equipped, but fundamentals in place
● Can be up and running within a few days
● Cheaper than hot sites but with a slight dela
Cold Site
Fewer facilities than warm sites
● May be just an empty building, ready in 1-2 months
● Cost-effective but adds more recovery time
Mobile Site
Can be hot, warm, or cold
● Utilizes portable units like trailers or tents
● Offers flexibility and quick deployment (e.g., military DJC2)
Resilience Testing
Assess system’s ability to withstand and adapt to disruptive events
■ Ensures the system can recover from unforeseen incidents
■ Conducted through tabletop exercises, failover tests, simulations, and parallel
processing
■ Helps prepare for events like power loss, natural disasters, ransomware attacks,
and data breache
Recovery Testing
Evaluates the system’s capacity to restore normal operation after a disruptive
even
Tabletop Exercise
Scenario-based discussion among key stakeholders
■ Assess and improve an organization’s preparedness and response
■ No deployment of actual resources
■ Identifies gaps and seams in response plans
■ Promotes team-building among stakeholders
Failover Tests
ontrolled experiment for transitioning from primary to backup components
■ Ensures uninterrupted functionality during disasters
■ Requires more resources and time
■ Validates the effectiveness of disaster recovery plans
■ Can identify and rectify issues in the failover process
Simulations
Computer-generated representation of a real-world scenario
■ Allows for hands-on response actions in a virtual environment
■ Assesses incident responders and system administrators in real-time
■ Helps evaluate reactions and staff performance
■ Provides feedback for learning and improvement
Parallel Processing
Replicates data and system processes onto a secondary system
■ Runs primary and secondary systems concurrently
■ Tests reliability and stability of the secondary setup
■ Ensures no disruption to day-to-day operations
■ Assesses the system’s ability to handle multiple failure scenarios simultaneousl