Disaster Recovery and Resilience Flashcards

1
Q

Q: What is Disaster Recovery (DR)?

A

A: A set of processes, tools, and strategies to ensure the recovery of IT systems, applications, and data after a disruptive event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Q: What does the resilience pillar of the AWS Well-Architected Framework emphasize?

A

A: Best practices for designing systems that can recover from failures and continue to function with minimal impact.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Q: How do AWS Regions and AZs support resilience?

A

A: By providing geographically isolated regions and multiple AZs within each region for high availability and fault tolerance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Q: What is Recovery Time Objective (RTO)?

A

A: The maximum acceptable downtime after a disaster before systems and applications must be restored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Q: What is Recovery Point Objective (RPO)?

A

A: The maximum acceptable amount of data loss measured in time, indicating how far back data should be restored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Q: What are the main DR strategies in AWS?

A
  • Backup and Restore
  • Pilot Light
  • Warm Standby
  • Multi-Site/Active-Active
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Q: What is the Backup and Restore strategy in DR?

A

A: A cost-effective method where data is backed up to AWS and restored during a disaster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Q: What is the Pilot Light strategy?

A

A: A small version of the application is always running in AWS, which can scale up quickly in a disaster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Q: What is the Warm Standby strategy?

A

A: A scaled-down version of the application runs continuously and can scale up to full capacity in a disaster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Q: What is the Multi-Site/Active-Active strategy?

A

A: Fully functional systems running in multiple regions, providing low RTO and RPO.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Q: What is AWS Elastic Disaster Recovery?

A

A: A service that replicates applications to AWS for rapid recovery during a disaster, minimizing downtime and data loss.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Q: What is S3 Cross-Region Replication?

A

A: A feature that automatically replicates S3 objects to a different AWS region for resilience and disaster recovery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Q: What is AWS Backup?

A

A: A managed service for automating backup and recovery of AWS resources, including EC2, RDS, DynamoDB, and more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Q: How does RDS Multi-AZ improve resilience?

A

A: By automatically replicating data to a standby instance in another AZ, ensuring high availability and durability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Q: What is Aurora Global Database?

A

A: A feature that provides cross-region replication with low latency for disaster recovery and global applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Q: What are DynamoDB Global Tables?

A

A: A multi-region, fully managed solution for replicating data across regions, providing high availability and resilience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Q: How does CloudFormation support disaster recovery?

A

A: By automating the deployment of infrastructure, enabling quick recovery of resources in another region.

18
Q

Q: How does Route 53 support disaster recovery?

A

A: By directing traffic to healthy resources using failover routing policies during outages.

19
Q

Q: How does Auto Scaling enhance resilience?

A

A: By automatically adding or removing instances to handle load changes or replace unhealthy instances.

20
Q

Q: What role does ELB play in resilience?

A

A: Distributes traffic across healthy instances in multiple AZs, ensuring availability during instance failures.

21
Q

Q: How does data encryption support disaster recovery?

A

A: By protecting data at rest and in transit using AWS KMS or customer-provided keys to ensure secure backups.

22
Q

Q: What is AWS Global Accelerator, and how does it help resilience?

A

A: A service that routes traffic to the nearest healthy endpoint globally, ensuring low-latency access and automatic failover.

23
Q

Q: Why is testing DR plans important?

A

A: To ensure systems can recover as expected and meet RTO and RPO objectives during an actual disaster.

24
Q

Q: How does Chaos Engineering enhance resilience?

A

A: By intentionally introducing failures into systems to test their ability to recover and improve fault tolerance.

25
Q

Q: What is AWS Fault Injection Simulator?

A

A: A service for running controlled fault experiments to improve application resilience and performance.

26
Q

Q: How does a multi-region architecture support DR?

A

A: By hosting resources in multiple AWS regions, ensuring availability if one region experiences a failure.

27
Q

Q: How does S3 Glacier support disaster recovery?

A

A: By providing low-cost archival storage for long-term backups and data recovery.

28
Q

Q: How does AWS help with DR compliance?

A

A: By offering compliance certifications like SOC, HIPAA, and ISO, and providing tools for audit logging and reporting.

29
Q

Q: What is AWS Config’s role in DR?

A

A: Tracks resource configurations and ensures compliance with DR policies, aiding in recovery verification.

30
Q

Q: What are AWS services for incident response in DR?

A

A: AWS Systems Manager, CloudTrail for auditing, and IAM for access control during a disaster.

31
Q

Q: How does AWS support backup retention policies?

A

A: AWS Backup allows setting lifecycle policies for data retention and automated archival to S3 Glacier.

32
Q

Q: Which AWS tools help with load testing for resilience?

A

A: Amazon CloudWatch, AWS Auto Scaling, and third-party tools like Locust or JMeter.

33
Q

Q: Which services enable real-time monitoring for DR?

A

A: Amazon CloudWatch, AWS X-Ray, and AWS Health Dashboard.

34
Q

Q: What is Business Continuity Planning (BCP)?

A

A: A broader strategy to ensure critical business functions continue during and after a disaster.

35
Q

Q: How is cross-account disaster recovery implemented in AWS?

A

A: By replicating backups or using AWS Organizations to centralize DR planning across multiple accounts.

36
Q

Q: How does Snowball Edge support disaster recovery?

A

A: By enabling data migration and offline storage for environments with limited connectivity.

37
Q

Q: How does AWS Systems Manager help in DR?

A

A: By automating recovery actions, patch management, and configuration updates across resources.

38
Q

Q: What services enable event-driven automation for DR?

A

A: AWS Lambda, Amazon EventBridge, and CloudWatch Events.

39
Q

Q: How can DR costs be optimized in AWS?

A

A: Use on-demand resources only during recovery, leverage S3 Glacier for archival storage, and scale down warm standby environments.

40
Q

Q: What are DR drills, and why are they important?

A

A: Simulated disaster recovery scenarios to test readiness and improve recovery processes.