01 - Databases in the Real World Flashcards

Question 1

Q

List

Design Considerations

3

Answer

A

Performance
High Availability
Backup and Recovery

Question 2

Q

List

Performance features required for the workload

4

Answer

A

Required latency
IOPS
Read/write throughput
Concurrency

Question 3

Q

Define

latency

Answer

A

how quickly users need a response
amount of time needed to complete an activity

Question 4

Q

Define

IOPS

Answer

A

Input/output operations per second
How often users are reading and writing data to the database
(e.g. 10 reads per second)

Question 5

Q

Define

concurrency

Answer

A

How many users are active at the same time
How many active users are accessing the same data at the same time?

Question 6

Q

How to improve latency and read/write throughput?

Answer

A

provision more IOPS when configuring the database

Question 7

Q

Differentiate

IOPS vs Throughput

Answer

A

Throughput - measurement of bits or bytes per second that can be processed by a storage device
IOPS - number of read/write operations per second.

Both IOPS and throughput can be used together to describe performance.

Question 8

Q

Define

High Availability

Answer

A

At any point where you try to access the database, it always gives the data needed (non-error response)

Question 9

Q

List

High Availability features required for the workload

3

Answer

A

Read replicas
Clustering
Geo-distributed deployments

Question 10

Q

Define

Read replicas

Answer

A

Create read-only copies
* Updates made to the source database are asynchronously copied to read replicas
* provides scalability
* can be promoted to a standalone database instance

Question 11

Q

Define

Clustering

Answer

A

Some nodes, that you can write and process
for write- and process-heavy workloads.
1 cluster = 1+ compute nodes replicated across multiple Availability Zones
to gain increased read scalability and failover protection.

Question 12

Q

Geo-distributed deployments

Answer

A

for databases across diff places
* If you deploy data in US, you are bound by US laws
* Ex. if theres a threat to natl security, they can look into data servers in the US
* You can read/write from other places, or pwedeng read lang ganun

Question 13

Q

Explain

Backup and Recovery

Answer

A

When disaster happens, need to make sure ur data is still accessible & u don’t lose data
Have multiple copies of data, so that when one copy is ruined, you still have other copies available

Question 14

Q

Define

RPO

Answer

A

returning point objective
getting data when the disaster happens → no data loss

Question 15

Q

Define

RTO

Answer

A

Returning Time Objective
* time it takes to go back to RPO
* Best RTO: as soon as possible

Question 16

Q

List

Workload requirements

3

Answer

Study These Flashcards

A

Data Storage
Data volume, velocity, and variety
Data Usage

Question 17

Q

Data Storage Types

4

Answer

Study These Flashcards

A

File system
Object store
Relational database
Nonrelational database

Question 18

Q

Differentiate

Data volume, velocity, and variety

Answer

Study These Flashcards

A

Data volume: size of individual items being written into workload AND total size of all items within workload.

Data velocity: how fast writes and reads are
* can cause data bottlenecks if your system is not properly tuned.

Data variety: indicator of the type of database or databases you may need for your workload.
* Before: structured data in relational databases; semistructured in nonrelational databases; unstructured data in file system.
* lines are more blurred now