Simple Storage Service (S3) Flashcards

1
Q

What is S3?

A

It is object storage (manages data as objects rather than in file systems or data blocks) that is highly scalable and simple to use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the maximum amount of storage allowed in S3?

A

The storage is unlimited.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the maximum and minimum size of a file in S3?

A

S3 objects can be between 0 bytes to a maximum of 5 terabytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Where are files stored in S3?

A

In buckets (which is essentially like a folder)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Can you use the same bucket name in two different accounts?

A

No, bucket names must be globally unique across all of AWS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the format of an S3 url?

A

https://bucket-name.s3.region.amazonaws.com/key-name
https://acloudguru.s3.us-east-1.amazonaws.com/Ralphie.jpg

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the key of an S3 object?

A

It is the name of the object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the value of an S3 object?

A

It is the data itself, which is made up of a sequence of bytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Version ID of an S3 object?

A

It is important for storing multiple versions of the object (when versioning is enabled).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the Metadata of an S3 object?

A

Data about the data you are storing (e.g. content-type, last-modified, etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does AWS protect your data from loss in S3?

A

It is stored across multiple devices and facilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the percentage of availability in S3?

A

It is 99.95% to 99.99%, depending on the S3 tier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the percentage of durability of S3?

A

It is designed for 99.999999999% (11-9’s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Standard S3 tier?

A

The default version of S3, where it is stored in at least 3 AZs.

It guarantees 99.99% availability and 99.999999999% durability.

It is designed for frequently accessed data and is suitable for most workloads, like websites, content distribution, mobile and gaming applications and big data analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Lifecycle Management in S3?

A

You define rules to automatically transition objects to a cheaper storage tier or delete objects that are no longer required after a set period of time.

It can be used in conjunction with versioning where it can be applied to current versions or previous versions of objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the ways you can secure your data in S3?

A
  1. Server-side encryption (default encryption on a bucket to encrypt all new objects when they are stored in the bucket)
  2. Access Control Lists (ACLs) which defines which AWS accounts or groups are granted access and the type of access (and can be attached to individual objects in a bucket)
  3. Bucket Policies (JSON polices applied to a bucket as a whole that are specific to what actions are allowed or denied)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What type of data consistency can you expect from S3?

A

Strong read-after-write consistency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

If you successfully upload a file to S3, what is the HTTP status code that is returned?

A

200

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the difference between Access Control Lists and Bucket Policies in S3?

A

Access Control Lists work on an individual object level and bucket polices are bucket-wide and are applied on the entire bucket.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

When you create a bucket in S3, is it by default private or public?

A

It is by default private.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What can you use to make an individual object in S3 public?

A

Object ACLs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What can you use to make an entire bucket public in S3?

A

Bucket policy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What type of website can be hosted in S3.

A

Only static content sites

24
Q

What are the five advantages of versioning in S3?

A
  1. All versions of an object are stored (even deleted versions)
  2. Can be a great backup tool
  3. Once enabled, it cannot be disabled
  4. It can be integrated into the lifecycle management rules
  5. It supports MFA
25
Q

What storage class of S3 would you use for static websites?

A

Standard S3

26
Q

What is S3 Standard Infrequently-Accessed (S3 Standard-IA) tier?

A

It is designed for rapid access of data that is accessed less frequently.

You pay to access the data (storage and retrieval fees).

It is a great use case for long-term storage for critical data, backups and as a data store for disaster recovery files.

27
Q

What is S3 Standard One-Zone Infrequently Accessed (IA) tier?

A

Stores data redundantly within a single AZ and costs 20% less than S3 Standard-IA.

This is great for long-lived, infrequently accessed, non-critical data.

28
Q

What is S3 Intelligent Tiering tier?

A

It automatically moves your data to the most cost-effective tier based on how frequently you access each object.

29
Q

What are the S3 storage tiers?

A
  1. S3 Standard
  2. S3 Standard Infrequently Accessed (S3 Standard-IA)
  3. S3 Standard One-Zone Infrequently Accessed (S3 Standard One Zone IA)
  4. S3 Intelligent Tiering
30
Q

What are the three S3 Glacier tiers?

A
31
Q

What is S3 Glacier?

A

It is a cheap storage tier that is used for archiving infrequently accessed data.

32
Q

What are the three options for S3 Glacier?

A
  1. Glacier Instant Retrieval (when you need to be able to instantly access the data)
  2. Glacier Flexible Retrieval (when you do not require instant access of your data, may take minutes up to 12 hours, for backup or disaster recovery use cases)
  3. Glacier Deep Archive (when you retain data sets for 7-10 years for regulatory compliance requirements, where retrieval time is 12 hours and bulk retrieval is 48 hours)
33
Q

What is the highest cost tier for S3?

A

S3 Standard

34
Q

If you want to optimize your costs for S3 for unknown access patterns, what storage tier would you recommend?

A

S3 Intelligent Tiering

35
Q

What storage tiers in S3 charge a retrieval fee?

A
  1. S3 Infrequently Accessed
  2. S3 One-Zone Infrequently Accessed
  3. Glacier Deep Archive
  4. Glacier Flexible Retrieval
  5. Glacier Instant Retrieval
36
Q

How long can you expect to wait to retrieve files from S3 Glacier Deep Archive?

A

12 hours (up to 48 hours)

37
Q

How long can you expect to wait to retrieve files from S3 Glacier Flexible Retrieval?

A

Minutes up to 12 hours

38
Q

What is S3 Object Lock?

A

You use this to store objects in a WORM model (write-once, read many).

It can be used to prevent objects from being deleted or modified for a set period of time or indefinitely.

It can be applied to individual objects or applied across a bucket.

It is used to meet regulatory requirements that require WORM storage.

39
Q

What are the two modes of S3 Object Lock?

A
  1. Governance mode (where users cannot overwrite or delete an object version or alter its setting unless they have special permission)
  2. Compliance mode (no one cal overwrite or delete an object for a set period of time)
40
Q

What is a Legal Hold in S3?

A

it is a feature of S3 Object Lock that allows you to put a hold on deleting or changing a version of an object until the hold is removed (no retention period).

41
Q

What permission do you need to add and remove a Legal Hold from an object in S3?

A

s3:PutObjectLegalHold

42
Q

What is S3 Glacier Vault Lock?

A

It allows you to easily deploy and enforce compliance controls for individual S3 Glacier vaults with a vault lock policy, where you can specify controls such as WORM. Once locked, the policy can no longer be edited.

A way of applying a WORM model to Glacier.

43
Q

If you have a scenario where you need to apply a WORM (write-once, read-many) model to Glacier storage, what feature of S3 would you recommend?

A

S3 Glacier Vault Lock

44
Q

What are the three types of encryption in S3?

A
  1. Encryption in transit (SSL/TLS, HTTPS)
  2. Encryption at rest (server-side encryption)
  3. Encryption at rest (client-side encryption)
45
Q

What are the types of server-side encryption in S3?

A
  1. SSE-S3 (S3 managed keys, using AES 256-bit encryption
  2. SSE-KMS (KMS managed keys)
  3. SSE-C (customer provided keys)
46
Q

What are the two ways of enforcing server-side encryption in S3?

A
  1. Console (select the encryption policy on your bucket)
  2. Bucket Policy (e.g. create a policy that denies any S3 put request that doesn’t include x-amz-server-side-encryption request header)
47
Q

What is an S3 prefix?

A

The folders inside an S3 bucket (e.g. mybucketname/folder1/subfolder1/myfile.jpeg, the prefix would be /folder1/subfolder1)

48
Q

How does S3 prefixes improve performance?

A

Because S3 allows 3,500 PUT/COPY/POST/DELETE and 5,500 GET/HEAD requests per second, per prefix, the more prefixes you have the higher performance you will get.

Effectively spreading your reads across multiple prefixes.

49
Q

How does KMS encryption limit performance in S3?

A

KMS has built-in limits that are region specific, but roughly 5,500, 10,000 or 30,000 requests per second and with each request to put or get a file, you must encrypt or decrypt respectively.

50
Q

If you are asked a scenario where you need to apply encryption in S3, what would you recommend?

A

SS3-S3 managed encryption because it won’t run into KMS limits.

51
Q

When are multipart uploads recommended in S3?

A

For files over 100 MB, but required for files over 5 GB.

52
Q

How does multipart uploads work in S3?

A

It parallelizes uploads to increase efficiency

53
Q

If you are asked a question about how to get better upload performance of large files in S3, what would you recommend?

A

Recommend multipart uploads

54
Q

How does S3 byte-range fetches work?

A

It parallelizes downloads by specifying byte ranges

55
Q

If you are asked a question about how to get download performance of large files in S3, what would you recommend?

A

S3 byte range fetches

56
Q

What is S3 replication?

A

It’s a way of replicating objects from one bucket to another.

57
Q

What are some limitations with S3 replication?

A
  1. Versioning must be enabled on both the source and destination in able to work.
  2. Only newly added files will be replicated, not existing files prior to enabling replication.
  3. Delete markers are replicated by default.