6. Databases Flashcards

Question 1

Q

What is RDS and its 6 engines?

Answer

A

Relational Database Service. Incorporates SQL Server, ORACLE, MySQL, PostgreSQL, MariaDB and Amazon Aurora

Question 2

Q

What is and is not RDS good for?

Answer

A

Good for Online Transaction Processing, e.g. payments
Not good for Online Analytical Processing (OLTS) -> use Redshift instead

Question 3

Q

What would you use for RDS disaster recovery?
Does it apply to all engines?
How does it work?

Answer

A

Multi-AZ. Multi-AZs are created by default.
All except Amazon Aurora, which does not need it because it is distributed by definition.
The primary is automatically replicated to Standby. The DB connection string is a URL address. Because Amazon handles DNS failover automatically, it will detect the downtime and automatically switch to the replica.

Question 4

Q

What would you use for RDS High Availability?
How does it work?
What do you also have to do to use it?

Answer

A

Amazon RDS Read Replicas
Up to 5 replicas can be created from the AWS console. Can be across different Regions and AZs
Enable automatic backups

Question 5

Q

What is Amazon Aurora?
What are its important qualities?

Answer

A

Amazon-proprietory RELATIONAL database, compatible with MySQL and PostgreSQL
- a mix of high performance, scalability and availability
- 6 copies of data: minimum 3 AZs, 2 copies each
- Starts with 10GB, automatically scaled in 10GB increments up to 128TB

Question 6

Q

What are Amazon Aurora’s Replica types?

Answer

A

Aurora Replica: up to 15 read replicas ALSO: automated failover, self-healing (2 write/3 read)
MySQL Replica: up to 5 read replicas, slower replication time, cross-Region, manual failover. ALSO: supports user-defined replication delay and different data schemas between primary and secondary
PostgreSQL Replica: up to 5 read replicas

Question 7

Q

What about Amazon Aurora’s backups and snapshots

Answer

A

Backups are automated, and always enabled; snapshots can be shared with other AWS accounts.

Question 8

Q

What would you do if you needed Aurora, but you have intermittent, infrequent or unpredictable workloads?

Answer

A

Use Amazon Aurora Serverless - on-demand, auto-scaling configuration. DB cluster automatically starts up, shuts down and scales.

Question 9

Q

What are the main qualities of DynamoDB?

Answer

A

NoSQL database; supports Document and Key-Value models
Stored on SSD, spreads across 3 geographically distinct data centres
Can be eventually consistent (~1sec) OR strongly consistent

Question 10

Q

When would you use
1. Aurora?
2. DynamoDB?
3. DAX?

Answer

A

You need a relational database compatible with MySQL or PostgreSQL
You need documents on the KV database for mobile, web, gaming, IoT
Improve the read performance of DynamoDB

Question 11

Q

When would you use
1. Aurora?
2. DynamoDB?

Answer

A

You need a relational database compatible with MySQL or PostgreSQL
You need document on KV database for mobile, web, gaming, IoT

Question 12

Q

What is DAX?

Answer

A

DynamoDB Accelerator. Fully managed highly available in-mem cache.
10x performance improvements - request time in microseconds
Application only needs to connect to DAX
Pay-per-request pricing; BUT you pay more than with the provisioned capacity

Question 13

Q

What is DAX?
What are its qualities?

Answer

A

DynamoDB Accelerator. Fully managed highly available in-mem cache.
10x performance improvements - request time in microseconds
Application only needs to connect to DAX
Pay-per request pricing; BUT you pay more than with the provisioned capacity

Question 14

Q

How do you deliver ACID in DynamoDB?
Why would you need it?

Answer

A

Use DynamoDB Transactions
Financial transactions; fulfilling orders; multiplayer game engines; distributed processes;

Question 15

Q

What are DynamoDB Transactions options?

Answer

A

Read: Eventual Consistency, Strong Consistency and Transactional
Write: Standard and Transactional

Question 16

Q

What is the concerns/trade-offs of DynamoDB Transactional Consistency?

Answer

A

Transaction size is 100 items or 4MB of data
Twice the cost, because DynamoDb needs to do “prepare” and “commit” for every item

Question 17

Q

How do you secure DynamoDB?

Answer

A

Encryption at REST using KMS
Site-to-site VPN
Direct Connect
IAM Policies and Roles
Fine-grained access
Integrates with CloudWatch and CloudTail

Question 18

Q

How do you ensure the durability of DynamoDB data?

Answer

A

DynamoDB backups: on-demand (same region), consistent with seconds and retained until deleted
Point-in-time Recovery: protects against accidental writes and deletes; 5 minutes in the past, restore to any point in the last 25 days (incremental backups); needs to be manually enabled

Question 19

Q

How do we log DynamoDB data changes?

Answer

A

DynamoDB Streams:
- FIFO, time ordered
- streams broken into shards
- retained for 24 hours

Question 20

Q

How do we do multi-Region replication on DynamoDB?
What do we need it for?

Answer

A

Global Tables:
- based on DynamoDB Streams
- Can be enabled from ASW console: Table -> Global Tables -> Create Replica
- any Region
- Replication latency under 1 sec
If we have globally distributed applications; DR or HA

Question 21

Q

How to run managed MongoDB cluster in AWS?
How to run managed Apache Cassandra in AWS?
How to migrate the above?

Answer

A

Use Amazon DocumentDB
Use Amazon Keyspaces
Use AWS Database Migration Service

Question 22

Q

How to implement a Graph database in AWS and why

Answer

A

Use Amazon Neptune for:
- identity graphs: social graphs, targeting, personalization, analytics
- knowledge graph applications: add topical data to product catalogues
- detect fraud patterns
- security graphs: visual infrastructure to plan, predict and mitigate risk

Question 23

Q

What is time-series data?
Why would you need it?
How to store it on Amazon?

Answer

A

Data points logged over a period of time.
Need to store large amounts of data for analysis. Examples:
- temperature sensors
- web traffic analytics
- DevOps application monitoring
Amazon Timestreams
- trillions of events per day
- 1,000x faster and 1/10th of const of relational databases

Question 24

Q

Why would you need Ledger Database?
How would you implement it in AWS?

Answer

A

1:
- store financial transactions
- reconcile supply chain systems
- maintain claims history
- centralise digital records
why: has cryptographically verifiable transaction log BUT: owned by ONE authority (I assume, not distributed)

Amazon Quantum Ledger Database (QLDB)