Continuity And Recovery 10 Flashcards
What is business impact analysis, BIA
A preparation step in BCP development that identifies present organizational risks and determines the impact to ongoing, business critical operations and processes of risks actually occur.
Contain vulnerability assessments and evaluations to determine risks and their impact. Should include all phases of the business to ensure a strong business continuation strategy
describe a business continuity plan, BCPs
A policy that defines how an organization will maintain normal day to day business operations in the event of disruption or crisis
Should involve the identification of critical systems and components to ensure that such assets are protected. Also ensures the survival of the organization by preserving key documents, establishing decision making authority, communicating with stakeholders and maintaining financial function. Should address infrastructure issues or fault tolerant systems.
Should be reviewed and tested regularly. The authorized executive should personally sign the plan.
What is maximum tolerable downtime, MTD
The longest period of time that a business outage may occur without causing irrecoverable business failure
Can be a range of minutes to hours for critical functions, 24 hours for urgent functions, 7 days for normal functions, etc
Limits the amount of recovery time to resume operations.
What is recovery point objective, RPO
The point in time, relative to a disaster, where the data recovery process begins.
Often when the last successful backup is performed before a disruptive event occurs
What is recovery time objectives, RTO
The length of time within which normal business operations and activities can be restored following a disturbance.
Includes the necessary recovery time to return to the RPO and reinstate the system and resume processing from its current status. RTO must be achieved before the MTD.
What is mean time to recovery, MTTR
The average time taken for a business to recover from an incident or failure and is an offset of the RTO. If exceeds the given RTO, then the business operations need to switch to the alternate site
Describe a continuity of operations plan and what it includes
The component of the BCP that provides best practices to mitigate risks, and best measures to recover from the impact of an incident.
Effective plans include :
Auditing resources, staff and operational management
Auditing storage facilities, data centers, os, and software and applications
Auditing networks like LAN and WAN including remote access and authentication systems
Analyzing comprehensive risk and vulnerability
Creating data backups, recovery methods, and emergency response procedures
Establishing a process on how to manage operations during a disaster
Name various alternate sites used to restore system functions
A hot site is a fully configured alternate network that can be online quickly after a disaster
A warm site is a location that is dormant or performs non-critical functions under normal conditions but can be rapidly converted to a key operations site if needed
A cold site is a predetermined alternate location where a network can be rebuilt after a disaster.
What is an IT contingency plan
A component of the BCP that specifies alternative IT contingency procedures that can be switched to when an organization is faced with an attack or disruption of service
Can include operating out of an alternate site, using alternate equipment or relocating the main system
Effectiveness depends on key personnel understanding the components of the plan and when and how it should be initiated.
Reviewing the checklist to assure all the aspects are in place
Providing adequate training to exercise the plan
What is a succession plan
Ensures that all key business personnel have designated backups who can perform critical functions when needed.
Describe the business continuity testing methods
Paper testing methods: review plan contents, analyzing the solution, using checklists
Performing walkthroughs: focus on each BCP phase
Parallel testing: test that the systems perform at any alternate offsite facility without taking the main system offline. Simulations effectively test the validity and compliance of the BCP
cutover: this test mimics an actual business disruption by shutting down the original site to test transfer and migration procedures to the alternate site and test operations in an emergency
What is disaster recovery plan, DRP
A plan that prepares an organization to react appropriately in the worst case scenario and provide the means to recover from such a disaster
Safety of personnel is most important concern
Can include:
A list and contact info of individuals responsible for recovery
Inventory of hardware and software
A record of important business and customer info required to continue business
A record of procedural manuals and other critical info such as BCPS and IT contingency plans
Specifications for alternate sites
What is fault tolerance
The ability of a network or system to withstand a foreseeable component failure and continue to provide an acceptable level of service.
Often employ duplication or redundancy of resources to maintain functionality if one component is damaged or fails.
Describe the two redundancy measures
MTTF mean time to failure
The rating that predicts the length of time that a device is expected to be operational
Used to evaluate the reliability of devices or components that are not repaired
MTBF mean time between failures
The rating on devices that predicts the expected time between failures. Based on the MTTF and or MTBF of a system. Must plan for redundancy measures
Describe disks redundancy measures
The redundant array of independent disks, RAID standards for fault tolerant configurations on multiple disk systems. If a disk fails data can be recovered from the remaining disks. RAID can be implemented through os software but is more efficient deployed through hardware based. RAID 0, 1, 5. RAID 0 does not reduce the threat of gloss due to disk failures