Section 12 & 14: Amazon S3 Introduction & Advanced S3 Flashcards

1
Q

What is Amazon S3?

A

Amazon S3 is a simple storage service that allows people to store objects (files) in buckets (directories).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are S3 Buckets?

A

Buckets are highly scalable object (file) storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do S3 buckets consist of?

A

Buckets have a globally unique name, are defined at the region level, file sizes must be between 0 bytes -5 TB, and offer unlimited storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How many AWS buckets per account by default?

A

100 buckets per account by default.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Is an S3 Bucket suitable for installing an OS or DB?

A

No, an S3 bucket is not suitable for installing an OS or DB. Block-based storage is better for this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do S3 objects consist of?

A

S3 objects consist of metadata, tags for security / lifecycle, Version ID (if versioning is enabled), and a key value (key = filename, value = data) and sub resources (access control lists and torrents).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What should you do if uploading for than 5 GB to S3?

A

You should use multi-part upload

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an S3 Object key composed of?

A

Prefic + object name. Example: s3://my-bucket/my_folder1/another_folder/my_file.txt

s3://my-bucket/prefix/object_name

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

At what level is S3 Versioning enabled?

A

S3 Versioning is enabled at the bucket level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What happens if you upload the same file to S3 twice?

A

The file will be assigned a unique version every time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Can you disable versioning on an S3 bucket once its enabled?

A

No. Versioning cannot be disabled, it can only be suspended. Suspending versioning does not delete previous versions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What if you have files that exist prior to versioning being enabled?

A

All existing files prior to the enabling of versioning will receive a version of “null”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does S3 Versioning do?

A

S3 Versioning stores all versions of an object (file) including deleted versions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What happens to the permissions on a newly uploaded version? What about the permissions on old versions?

A

For newly uploaded versions, permissions are automatically reset to private.
For old versions, the existing permissions are retained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What happens when you delete an object (file) with versioning enabled?

A

Deleting a file with versioning enabled will simply set a delete marker (it won’t actually be deleted). You can restore this version by deleting the delete marker. However, deleting an individual, specific version will actually delete the file version.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the four (4) methods of encrypting objects in S3?

A

1) SSE-S3 (Server-Side-Encryption)
2) SSE-KMS (Server-Side-Encryption, AWS Key-Management-System)
3) SSE-C (Server-Side-Encryption, Customer)
4) Client Side Encryption

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How does SSE-S3 encrypt S3 objects? Main use case?

A

SSE-S3 encrypts S3 objects using keys handled & managed by AWS. Its main use case is encrypting user data on S3.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How does SSE-KMS encrypt S3 objects? Main advantages?

A

SSE-KMS encrypts S3 objects by leveraging AWS’s Key Management Service (KMS) to manage the encryption keys. KMS provides a customer managed key and an audit trail of who uses the key, and when they used it.

SSE-KMS’s main advantage is user control & audit trail, giving users full control over encryption key rotation policies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

When should you use SSE-C?

A

SSE-C should be used when you want to manage your own encryption keys, outside of AWS. AWS S3 does not store the encryption key provided in every HTTPS request. SSE-C can only be done via the AWS CLI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is Client Side Encryption?

A

Client Side Encryption is when the encryption occurs on the client side, rather than the server side. As a result, AWS will not know your encryption key.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How does Client Side Encryption work?

A

Clients must encrypt data themselves before sending it to S3, and decrypt data themselves when retrieving from S3. Customers fully manage the keys and encryption cycle themselves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are the three Encryption Types?

A

1) In transit (uses SSL / TLS)
2) At Rest (server-side)
3) At Rest (Client Side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the two main types of S3 Security?

A

1) User-based
2) Resource-based

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Regarding S3 security, by default all buckets are ____

A

By default, all S3 buckets are private.

25
Q

Explain User-based S3 security: How does it work?

A

S3 User-based security is achieved through IAM policies that decide which API calls should be allowed for a specific user. An IAM principal can access an S3 object if their IAM permissions allow it OR the resource policy allows it. Additionally, there must be no explicit DENY

26
Q

Explain Resource-based S3 security: How does it work?

A

Resource based security works by instituting S3 Bucket Policies that apply at the bucket level

27
Q

What is an S3 Bucket Policy?

A

An S3 bucket policy is a bucket-wide rule that is set from the S3 console.

28
Q

What is a good S3 setting to prevent company data leaks from S3?

A

Block Public Access setting. This prevents access outside of your VPC

29
Q

What are the three (3) main uses of an S3 Bucket Policy?

A

1) To Grant public access to a bucket
2) To force objects to be encrypted at upload to the bucket
3) To grant access to another AWS account (cross account)

30
Q

What is the strongly recommended requirement to include for deleting an S3 bucket or bucket objects?

A

MFA (multi-factor-authentication) should be required for deleting buckets and objects in S3

31
Q

What is a Pre-Signed URL?

A

A Pre-Signed URL is a URL that is only valid for a limited time. Users given pre-signed URLs inherit permissions of the person who generated the pre-signed URL.

32
Q

Can S3 host websites?

A

Yes. S3 can host static websites and have them accessible on the public web.

33
Q

How does S3 interact with CORS?

A

If a client does a CORS request on our S3 bucket, the correct CORS headers must be enabled & defined on the cross origin bucket. This is a popular exam question

34
Q

What happens to the permissions in a replicated S3 bucket?

A

Nothing. Replicated buckets maintain the permissions of their original.

35
Q

What happens to the objects in a replicated bucket?

A

A replicated bucket cannot replicate the objects in the original bucket.

36
Q

What are the three (3) use cases for S3 Cross Region Replication (CRR)?

A

1) Compliance
2) Lower latency access
3) Replication across accounts

37
Q

What are the two (2) main use cases for S3 Same Region Replication (SRR)?

A

1) Log aggregation
2) Live replication from production / test accounts

38
Q

Is there S3 replication chaining?

A

No, there is no replication chaining in S3. Bucket A can not replicate to B, which then replicates to C. Instead, bucket A has to replicate to B, and then replicate itself to C.

39
Q

Explain S3 notifications, and one (1) good use case.

A

S3 notifications allow you to receive notifications (self-created or pre-created) when certain events happen in your bucket. Use case: send notification to a destination and trigger an action (like triggering a Lambda)

40
Q

What are the four (4) S3 Storage classes?

A

1) S3 Standard (general purpose)
2) S3 Infrequent Access (IA)
3) S3 One-Zone IA
4) S3 Intelligent Tiering

41
Q

What is the most expensive S3 Storage Class? Why?

A

S3 Standard Purpose, because it has storage across multiple devices, highest durability and availability (99.99%), and is highly resistant to AZ failures.

42
Q

What are two (2) main use cases of S3 Standard Purpose Storage?

A

1) Big Data analytics
2) Content Distribution

43
Q

When should you use S3 Infrequent Access (IA) storage and what is the major use case?

A

You should use S3 Infrequent Access (IA) when data is less frequently accessed, but requires rapid access when it is needed.
Use Case: backups

44
Q

When should you use S3 One Zone IA storage? Major use case?

A

You should use S3 One Zone for lower-cost, infrequently accessed data that doesn’t require multiple AZ resilience.
Use case: data that can recreated & cached.

45
Q

When should you use S3 intelligent Tiering?

A

Mostly when you want automatic cost optimization. Automatically move you to the most cost effect tier without performance impact (switches between S3 standard and IA)

46
Q

What is Amazon Glacier?

A

Amazon Glacier is a low cost object storage meant for backups in S3. Data is retained for the long term (decades). Glacier is for Archives.

47
Q

What on-premise technology is Glacier an alternative for?

A

Glacier is an alternative to on-premises magnetic tape storage.

48
Q

How do you move files from Glacier to S3?

A

You need to restore the files from Glacier before moving them to S3.

49
Q

What are the three (3) Amazon Glacier retrieval options?

A

1) Expedited (1-5 min)
2) Standard (3-5 hrs)
3) Bulk (5-12 hrs)

50
Q

What is the minimum storage duration for items in Glacier?

A

90 days

51
Q

What is the lowest cost Glacier Storage class?

A

Glacier Deep Archive is the lowest cost storage class, for long term storage with minimum storage duration of 180 days.

52
Q

What is the WORM model? Why is it useful?

A

WORM = Write once, read many model.
WORM is useful for meeting regulatory requirements as it prevents objects from being modified for a certain period, or indefinitely. One example is legal retention periods.

53
Q

What is S3 Lifecycle Management?

A

S3 Lifecycle management is the automation of moving objects between storage tiers. For example, if an object hasn’t been used for 30 days, automatically put it to deeper, cheaper storage.

54
Q

How does Amazon determine S3 Pricing?

A

Via Storage by the GB, Number of requests, the tier of Storage Management, Data transfer amounts, and how fast Data transfer is needed.

55
Q

Which is better, S3 Standard Storage or S3 Intelligent Tiering Storage?

A

Intelligent Tiering is better, and should be used for almost all cases (unless working with a massive amount of objects)

56
Q

How can you increase S3 Transfer Acceleration?

A

Increase S3 transfer speed b transferring files to an AWS edge location. This will forward data to your S3 bucket in the target region. This strategy is compatible with multi-part upload.

57
Q

When is it recommended to use Multi-Part Upload to an S3 bucket?

A

It is recommended to use Multi-part Upload for files over 100 MB, require for files over 5 GB. Uploads should be parallelized (speeds up transfers)

58
Q

What are S3 Byte-Range fetches / Parallelized downloads?

A

Parallelized downloads are created by specifying byte ranges for uploads. This limits failures only to specific byte ranges.