Data Stores Flashcards

1
Q

Concepts

What are the three types of data persistence?

A

Persistent, Transient, Ephemeral

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Concepts

What is a persistent data store?

A

Durable data that sticks around after reboots, restarts, power cycles, etc.

Glacier, RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Concepts

What is a transient data store?

A

Temporary data that is stored and passed along to another process or storage
ex. SQS, SNS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Concepts

What is an ephemeral data store?

A

A temporary store where data is lost after stop.
ex. EC2 Instance store, memcached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Concepts

Explain IOPS

A

A measure of how fast we can read and write to a device

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Concepts

What is throughput?

A

A measure of how much data can be moved at a time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Concepts

What are the two types of consistency models in data storage?

A

ACID and BASE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Concepts

What is ACID consistency model?

A

Atomic - All or nothing
Consistent - Transactions are valid
Isolated - Transactions don’t interfere with each other
Durable - Transactions stick around and won’t disappear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Concepts

What is the BASE consistency model?

A

Basic availability - Values availability even if stale
Soft-state - Might not be consistent across all stores
Eventual consistency - Will achieve consistency eventually

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Concepts

What is the benefit of ACID over BASE?

A

Data is always consistent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Concepts

What is the benefit of BASE?

A

It scales much better than ACID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

S3

What kind of Store is S3?

A

Object Store

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

S3

What is the maximum object that can be stored in S3?

A

5TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

S3

What is the maximum PUT size when uploading to S3?

A

5GB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

S3

When is it recommended to use multi-part uploads?

A

When the file size is larger than 100MB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

S3

The S3 “path” is not a file path, but what?

ex: s3://mybucket/finance/april/16/invoice_45675.pdf

A

A “key”

This key uniquely identifies the record in the file store. The record be

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

S3

What security measures does S3 provide?

A

Recourse-based (object, ACL, bucket policy)
User-based (IAM Policies)
Optional multi-factor auth before elete

18
Q

S3

True or False: You can enable versioning in S3?

A

True

S3 will create a new version of the file with each write

19
Q

S3

What is the benefit to S3 versioning?

A

Enables “rollback,” and can be integrated with lifecycle management capabilities

20
Q

True or False: Versioning is not compatible with S3 lifecycle capabilities

A

False

Versioning IS compatible with lifecycle capabilities

21
Q

S3

What is the downside to versioning?

A

Old versions count as billable size until permanently deleted

Ensure you have a lifecycle management policy to control costs

22
Q

S3

True or False: S3 supports cross-region replication?

A

True

23
Q

S3

What are the benefits of cross-region replication?

A

Security Requirements
Compliance Requirements
Latency

24
Q

S3

Why should you consider latency for cross-region replication?

A

Customers accessing data closer to them results in faster response times

25
Q

S3

Tier: S3 Standard

A

Frequently accessed data with redundancy

26
Q

S3

Tier: Standard-IA

A

Long-lived, infrequently accessed data

27
Q

S3

Tier: One Zone-IA

A

Long-lived, infrequently accessed, non-critical data

28
Q

S3

Tier: Reduced Redundancy

A

Frequently accessed, non-critical data

29
Q

S3

Tier: Intelligent Tiering

A

Long-Lived data with changing or unknown access patterns

30
Q

S3

Tier: Glacier

A

Long-Term data archiving

Retrieval times ranfing from minutes to hours

31
Q

S3

Tier: Glacier Deep Archive

A

Long-term data archiving

Retrieval times within 12 hours

32
Q

S3

What is intelligent tiering?

A

Automatic moving of data between tiers based on usage

To reduced storage costs

33
Q

S3

Why should someone consider intelligent tiering when it’s more expensive?

A

Depending on the use case, you will make it up in reduced storage costs

34
Q

S3

True or False: Intelligent tiering will automatically archive unused data?

A

False: You have to configure archival policies

35
Q

S3

What are the 3 benefits of S3 lifecycle management?

A

Optimize storage costs
Adhere to data retention policies
Keep S3 buckets well maintained

36
Q

S3

In what two ways can lifecycle management transition your files?

A

Transition between storage classes
Transition to archive

37
Q

S3

What three ways can files be marked as lifecycle managed?

A

Prefixes
Tags
Current version vs previous version

38
Q

S3

What services are used with DataLakes?

S3 Analytics

A

Athena, Redshift, Spectrum, Quicksight

39
Q

S3

What service is used with streaming data?

S3 Analytics

A

Kinesis Firehose

40
Q

S3

What services support AI/ML

S3 Analytics

A

Rekognition, Lex, MXNet

41
Q

S3

What service supports Storage class analysis?

S3 Analytics

A

S3 Management Analytics

42
Q
A