Data Stores Flashcards

(42 cards)

1
Q

Concepts

What are the three types of data persistence?

A

Persistent, Transient, Ephemeral

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Concepts

What is a persistent data store?

A

Durable data that sticks around after reboots, restarts, power cycles, etc.

Glacier, RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Concepts

What is a transient data store?

A

Temporary data that is stored and passed along to another process or storage
ex. SQS, SNS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Concepts

What is an ephemeral data store?

A

A temporary store where data is lost after stop.
ex. EC2 Instance store, memcached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Concepts

Explain IOPS

A

A measure of how fast we can read and write to a device

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Concepts

What is throughput?

A

A measure of how much data can be moved at a time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Concepts

What are the two types of consistency models in data storage?

A

ACID and BASE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Concepts

What is ACID consistency model?

A

Atomic - All or nothing
Consistent - Transactions are valid
Isolated - Transactions don’t interfere with each other
Durable - Transactions stick around and won’t disappear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Concepts

What is the BASE consistency model?

A

Basic availability - Values availability even if stale
Soft-state - Might not be consistent across all stores
Eventual consistency - Will achieve consistency eventually

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Concepts

What is the benefit of ACID over BASE?

A

Data is always consistent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Concepts

What is the benefit of BASE?

A

It scales much better than ACID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

S3

What kind of Store is S3?

A

Object Store

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

S3

What is the maximum object that can be stored in S3?

A

5TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

S3

What is the maximum PUT size when uploading to S3?

A

5GB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

S3

When is it recommended to use multi-part uploads?

A

When the file size is larger than 100MB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

S3

The S3 “path” is not a file path, but what?

ex: s3://mybucket/finance/april/16/invoice_45675.pdf

A

A “key”

This key uniquely identifies the record in the file store. The record be

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

S3

What security measures does S3 provide?

A

Recourse-based (object, ACL, bucket policy)
User-based (IAM Policies)
Optional multi-factor auth before elete

18
Q

S3

True or False: You can enable versioning in S3?

A

True

S3 will create a new version of the file with each write

19
Q

S3

What is the benefit to S3 versioning?

A

Enables “rollback,” and can be integrated with lifecycle management capabilities

20
Q

True or False: Versioning is not compatible with S3 lifecycle capabilities

A

False

Versioning IS compatible with lifecycle capabilities

21
Q

S3

What is the downside to versioning?

A

Old versions count as billable size until permanently deleted

Ensure you have a lifecycle management policy to control costs

22
Q

S3

True or False: S3 supports cross-region replication?

23
Q

S3

What are the benefits of cross-region replication?

A

Security Requirements
Compliance Requirements
Latency

24
Q

S3

Why should you consider latency for cross-region replication?

A

Customers accessing data closer to them results in faster response times

25
# S3 Tier: S3 Standard
Frequently accessed data with redundancy
26
# S3 Tier: Standard-IA
Long-lived, infrequently accessed data
27
# S3 Tier: One Zone-IA
Long-lived, infrequently accessed, non-critical data
28
# S3 Tier: Reduced Redundancy
Frequently accessed, non-critical data
29
# S3 Tier: Intelligent Tiering
Long-Lived data with changing or unknown access patterns
30
# S3 Tier: Glacier
Long-Term data archiving | Retrieval times ranfing from minutes to hours
31
# S3 Tier: Glacier Deep Archive
Long-term data archiving | Retrieval times within 12 hours
32
# S3 What is intelligent tiering?
Automatic moving of data between tiers based on usage | To reduced storage costs
33
# S3 Why should someone consider intelligent tiering when it's more expensive?
Depending on the use case, you will make it up in reduced storage costs
34
# S3 True or False: Intelligent tiering will automatically archive unused data?
False: You have to configure archival policies
35
# S3 What are the 3 benefits of S3 lifecycle management?
Optimize storage costs Adhere to data retention policies Keep S3 buckets well maintained
36
# S3 In what two ways can lifecycle management transition your files?
Transition between storage classes Transition to archive
37
# S3 What three ways can files be marked as lifecycle managed?
Prefixes Tags Current version vs previous version
38
# S3 What services are used with DataLakes? | S3 Analytics
Athena, Redshift, Spectrum, Quicksight
39
# S3 What service is used with streaming data? | S3 Analytics
Kinesis Firehose
40
# S3 What services support AI/ML | S3 Analytics
Rekognition, Lex, MXNet
41
# S3 What service supports Storage class analysis? | S3 Analytics
S3 Management Analytics
42