AWS Reliability Pillar (March 2018) Flashcards

Question 1

Q

Reliability Pillar description

Answer

A

The ability of a system to recover from disruptions, dynamical scale and mitigate disruptions

Question 2

Q

Service Availability Definition

Answer

A

Percentage of time an application is operating normally

Availability = Normal Operations Time / Total Time

Question 3

Q

Availability with hard dependencies

Answer

A

Many systems are dependent on other systems, where a downstream interruption interrupts the upstream system

Question 4

Q

Calculating availability with redundant components

Answer

A

When system uses independent, redundant components (ie AZ’s) the theoretical availability rate is 100% - (product of component failure rates)

Question 5

Q

Calculating dependency availability

Answer

A

Estimate by determining MTBF and MTTR (mean time to recover)

Availability = MTBF / (MTBF + MTTR)

Question 6

Q

Describe costs of high availability

Answer

A

innovation suffers because of need to move slowly

more testing and validation

software and services are more expensive

Question 7

Q

First Step of planning network topology -

Planning IP addressing

Answer

A

Allow IP address space for > 1 VPC per region

Consider cross-account connections (connecting multiple VPCs in the organization)

Within a VPC allow space for multiple subnets that span multiple AZ’s

leave unused CIDR block space in a VPC

Question 8

Q

Second step of planning network topology -

Ensure resiliency of connectivity

Answer

A

How will you provide resiliency from failures

how will you handle misconfigurations that cause outages\

how will you handle unexpected increases in traffic

how will you handle DoS attacks

Question 9

Q

Where is connectivity to a VPC governed?

Answer

A

In route table entries

These all function through the route table:

internet gateay
NAT gateway
virtual private gateway
VPC peering

Question 10

Q

Key services for Network Topology

Answer

A

VPC

also:
Direct Connect
EC2 - run VPN appliances
Route 53 - DNS integrated with ELB helps defend from DoS
ELB - balances across AZ’s, Layer 7 routing, integrates with WAF and auto-scaling
AWS Shield - automatic protection against DDoS
AWS Shield Advanced - protects ELB, CloudFront and Route53 Zones

Question 11

Q

Questions to ask when planning for reliability (how many nines do you really need)?

Answer

A

Note that 5 9’s is typically too expensive to be feasible but possible.

What problems are you trying to solve

what specific aspects of the app require specific levels of availability

what amount of cumulative downtime dan this workload realistically accumulate in a year

In essence, what’s the real impact of the system being unavailable?

Question 12

Q

What’s the goal of decomposing an application into its parts and evaluating reliability requirements for each?

Answer

A

To find the ones that truly require high reliability - to minimize the expense in making things HA that don’t need it

Question 13

Q

Define “Data Plane” and “Control Plane”

Answer

A

Data Plane delivers real time service
ie EC2 instances, RDS databases,

Control Plane configures the environment
ie launching new instances, add/change table metadata

Question 14

Q

Do data planes typically have higher availability requirements than control planes?

Question 15

Q

5 most common ways to improve availability

Answer

A

fault isolation zones

redundant components

micro-service architecture

recovery oriented computing

distributed systems best practices

AWS Reliability Pillar (March 2018) Flashcards

(15 cards)