Architecting to scale Flashcards

Question

How long is the default cooldown period?

Answer 1

300 seconds

Answer 2

Dynamic scaling

Answer 3

Sanity check to see if adding the resource was enough to absorb the load.

Answer 4

1) Target tracking policy- initiates scaling to try and track as closely as possible to a given metric 2) Step scaling policy- Based on a metric it adjusts capacity to a given defined threshold 3) Scheduled scaling policy- Initiated scaling events based on a pre-defined time, day or date

Answer 5

I want my ECS (container) hosts to stay at or below 70% CPU utilization

Answer 6

I want to increase my EC2 spot fleet by 20% everytime I add another 10,000 connections on my ELB

Answer 7

Every Moday at 08:00 I want to increase the read capacity units of my DynmoDB table to 20,000

Answer 8

Shard is the base throughput unit of an Amazon Kinesis data stream. One shard provides a capacity of 1MB/sec data input and 2MB/sec data output.

Answer 9

1) partition key 2) sequence (order of the shard in a sequence) 3) data

Answer 10

1) Throughput- Read capacity units and write capacity units | 2) Size- Max size is 400KB but it can scale as you can store as many as you like

Answer 11

A physical space where DynamoDB is stored

Answer 12

A unique identifier for each record sometimes called a hash key

Answer 13

An optional key that defines storage order on the partition

Answer 14

DynamoDB adds additional partitions to scale out

Answer 15

You work out how many partitions you need by capacity (how many RCU and WRU you have provisioned!) and the size. Then take the MAX of the largest dimension and round up to get the total number of partitions

Answer 16

(total RCU/3000) + (total WCU/1000)

Answer 17

Total size in GB/10GB

Answer 18

Splits equally across partitions

Answer 19

The data is divided down the middle and creates another partition based on the partition key hash. This will keep happening to scale out.

Answer 20

When read and writes are concentrated in the same partition. For example, if you used a date as a partition key and store lots of different data under the same date, when querying this data it will be accessing the same partition over and over...

Answer 21

Choose a different variable for a partition key. e.g. one that is not date and is by sensor type for example and use date as a sort key

Answer 22

It will not scale down, there are some work around like sending dummy requests at reducing frequency or reducing the max capacity to equal the min capacity

Answer 23

Like a copy of the table

Answer 24

Using a 'on-demand' setting for DynamoDB, costs more! but is useful when you are not sure if an app will be super popular

Answer 25

An in-memory cache that sits in front of your table

Answer 26

When you require the fastest possible reads from a database, such as live auctions or securities trading or read intense scenarios where you want to offload the reads from DynamoDB Repeated reads against a large set of dynamoDB data

Answer 27

1) write intense applications that don't have many reads | 2) Applications where you use client caching methods

Answer 28

Static and dynamic content

Answer 29

Delievered using HTTP cookies forwarded from your origin

Answer 30

HTTP and HTTPS

Answer 31

S3, EC2, ELB or another webserver

Answer 32

You can use behaviours to configure serving up origin content based on URL paths. This will route user to different content origins based on a URL path e.g. wp-content/* static content wp-admin/ directs to ELB...

Answer 33

A way of invalidating a CloudFront cache

Answer 34

1) simply delete the file from the origin and wait for the TTL (time to live) to expire 2) Use the AWS console to request invalidation for all content or a specific path such as /images/* 3) Use the CloudFront API to submit an invalidation request 4) Use a 3rd party tool to perform a CloudFront invalidation e.g. cloudberry, ylastic....

Answer 35

Yes A domain without a www. or subdomain in front

Answer 36

Yes, you can whitelist (show) or blacklist (block) content based on location

Answer 37

Simple Notification Service. Scalable hosted a queuing service. Is integrated with KMS for encryptedmessaging

Answer 38

Transient. 4 days default, max 14 days

Answer 39

256KB or 2GB using the SDK

Answer 40

Allows the creation of a loosely coupled architecture

Answer 41

Standard- No assurances that a message will enter and leave the queue based on the order they arrived FIFO- Will maintain the order of the queue

Answer 42

There is a risk that order will be lost for the process

Answer 43

If a message fails it will hold all the other messages behind it- causes delay or latency

Answer 44

A implementation of ApacheMQ. A message broker. Usually used to replace on-prem message broker.

Answer 45

Where a lambda function call sets of multiple lambda calls in parallel

Answer 46

An opensource framework for building a serverless app on AWS

Answer 47

1) create your YAML file 2) convert this to a CloudFormation 3) Creates AWS infrastructure

Answer 48

1) uses YAML for templates 2) Purpose built to help make developing serverless apps as efficient as possible 3) Generates CloudFormation scripts

Answer 49

1) uses YAML for templates 2) Purpose built to help make developing and deploying serverless apps 3) Generats Cloud formation scripts 4) Supports many other cloud providers such as Azure...

Answer 50

Designed to link a variety of AWS and 3rd party apps | e.g. integrate ZenDesk with your application

Answer 51

Creates a distributed asynchronous system workflows. It support sequential as well as parallel workflows. Activity worker and a Decider worker

Answer 52

Best suited for human-enabled workflows like order fulfilment or procedure requests

Answer 53

Step function

Answer 54

A way to manage workflows. An orchestration platform. You define you app as a state machine. Each object can assume a different state throughout a process. Creates tasks, sequential steps, parallel steps etc...

Answer 55

A management tool for creating and executing batch orientated tasks using EC2 instances

Answer 56

1) Create a compute environment 2) Create a job queue with the priority assigned to a compute environment 3) Create a job description, script or JSON, env vars, IAM roles e.t.c. 4) Schedule the job

Answer 57

out of the box coordination of an AWS service component use case- order processing flows

Answer 58

When you need to support external processes or specialised execution logic use case- loan application process with manual review steps

Answer 59

Messaging queue store and forward patterns use case- image resize process

Answer 60

Scheduled or re-occurring tasks that do not require heavy logic use case- Rotate logs daily on firewall appliance

Answer 61

Designed for big data processing and analysis. It is comprised of a hadoop framework. It is a collection of services to process large data sets. "The Zoo"

Answer 62

A tool used for distributed processing

Answer 63

A Hadoop distributed file system. A persistent data store

Answer 64

A tool to ensure resources are coordinated in a hadoop framework

Answer 65

Hadoop workflow framework

Answer 66

A hadoop scripting framework

Answer 67

A SQL interface into a hadoop landascape

Answer 68

A machine learning component in the hadoop landscape

Answer 69

A columnar database for storing hadoop data

Answer 70

A log collection system for a hadoop landscape

Answer 71

Facilitates input of data from other data stores into a hadoop landscape

Answer 72

A tool used to manage and monitor a hadoop landscape

Answer 73

When an application/software is developed from scrap

Answer 74

When an application/software is developed or built from an existing program?

Answer 75

1) size of the table 2) Number of RCUs 3) Number of WRUs

Answer 76

Reduce the cooldown time to allow scaling to be more dramatic and responsive

Answer 77

Kinesis Firehose

Answer 78

1) They can enable real-time reporting and analysis of streamed data 2) They can accept data as soon as it has been produced with out the need for batching

Answer 79

Dynamic based on a metric like connections or CPU. If using scheduled you would be scaling even when there is no spike.

Answer 80

More atomic functional units

Answer 81

A method of reading data from a shard

Answer 82

Use one table per application period. If all time series data in table the last partition would get all the read and write actions General DynamoDB best practice is to keep the number of tables to a minimum.

Architecting to scale Flashcards

(109 cards)