01 - Databases in the Real World Flashcards

1
Q

List

Design Considerations

3

A
  1. Performance
  2. High Availability
  3. Backup and Recovery
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

List

Performance features required for the workload

4

A
  1. Required latency
  2. IOPS
  3. Read/write throughput
  4. Concurrency
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define

latency

A
  • how quickly users need a response
  • amount of time needed to complete an activity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define

IOPS

A

Input/output operations per second
How often users are reading and writing data to the database
(e.g. 10 reads per second)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define

concurrency

A
  • How many users are active at the same time
  • How many active users are accessing the same data at the same time?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to improve latency and read/write throughput?

A

provision more IOPS when configuring the database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Differentiate

IOPS vs Throughput

A
  • Throughput - measurement of bits or bytes per second that can be processed by a storage device
  • IOPS - number of read/write operations per second.

Both IOPS and throughput can be used together to describe performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define

High Availability

A

At any point where you try to access the database, it always gives the data needed (non-error response)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

List

High Availability features required for the workload

3

A
  1. Read replicas
  2. Clustering
  3. Geo-distributed deployments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define

Read replicas

A

Create read-only copies
* Updates made to the source database are asynchronously copied to read replicas
* provides scalability
* can be promoted to a standalone database instance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define

Clustering

A
  • Some nodes, that you can write and process
  • for write- and process-heavy workloads.
  • 1 cluster = 1+ compute nodes replicated across multiple Availability Zones
  • to gain increased read scalability and failover protection.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Geo-distributed deployments

A

for databases across diff places
* If you deploy data in US, you are bound by US laws
* Ex. if theres a threat to natl security, they can look into data servers in the US
* You can read/write from other places, or pwedeng read lang ganun

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain

Backup and Recovery

A
  • When disaster happens, need to make sure ur data is still accessible & u don’t lose data
  • Have multiple copies of data, so that when one copy is ruined, you still have other copies available
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define

RPO

A

returning point objective
getting data when the disaster happens → no data loss

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define

RTO

A

Returning Time Objective
* time it takes to go back to RPO
* Best RTO: as soon as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

List

Workload requirements

3

A
  1. Data Storage
  2. Data volume, velocity, and variety
  3. Data Usage
17
Q

Data Storage Types

4

A
  1. File system
  2. Object store
  3. Relational database
  4. Nonrelational database
18
Q

Differentiate

Data volume, velocity, and variety

A

Data volume: size of individual items being written into workload AND total size of all items within workload.

Data velocity: how fast writes and reads are
* can cause data bottlenecks if your system is not properly tuned.

Data variety: indicator of the type of database or databases you may need for your workload.
* Before: structured data in relational databases; semistructured in nonrelational databases; unstructured data in file system.
* lines are more blurred now

19
Q

How will the data in your workload be used?

5

A

SQL data organization
* OLTP or OLAP
* DSS
* Data warehouse

NoSQL access patterns
* IoT
* Session state

20
Q

T/F

In an ideal scenario, the application server communicates with the database server over a public network.

A

F
private network

21
Q
A