Amazon S3 Flashcards

Question 1

Q

What are objects in S3?

Answer

A

Objects in S3 are file-like entities that contain data. They represent data and not infrastructure, which is what S3 buckets are.
Objects are stored in buckets.

Question 2

Q

What is S3 (Simple Storage Service)?

Answer

A

S3 is an object-based storage service that is kind of unlimited and serverless, meaning the underlying infrastructure is managed by AWS. The S3 Console provides an interface to upload or access data.

Question 3

Q

What does an S3 object consist of?

Answer

A

An S3 object may consist of:

Key: name of the object
Value: data stored
Version ID: version of the object (if versioning is enabled)
Metadata: additional information

Question 4

Q

What are Etags in S3?

Answer

A

Etags are entity tags used for detecting whether a change to a file has been made without downloading. They can also be used for checking data integrity and are typically represented by an MD5 hash.

Question 5

Q

When are Etags returned in S3?

Answer

A

Etags are returned on:

PUT: upload, including multipart or copy
GET: download, list (ETag is not included in the response body when listing objects)
HEAD: fetching metadata without downloading the file

Question 6

Q

How can Etags be used in combination with conditional requests?

Answer

A

Etags can be used for caching with If-None-Match and for synchronization with If-Match.

Question 7

Q

What is the purpose of checksums in S3?

Answer

A

Checksums are used to ensure the data hasn’t become corrupted in transit.

Question 8

Q

What are S3 Object prefixes?

Answer

A

S3 Object prefixes are part of the object key name. They help organize, group, and filter data.

Question 9

Q

What are S3 buckets?

Answer

A

Buckets hold objects or folders (which are not true folders) that store objects. Each bucket must have a unique name, is region-specific, and represents infrastructure.

Question 10

Q

What are the key rules for S3 bucket naming?

Answer

A

Length: 3-63 characters long
Characters: lowercase letters, numbers, dots (.), and hyphens (-)
Start and End: must begin and end with a letter or number
No adjacent periods
Cannot be formatted as IP addresses
No uppercase letters, underscores, spaces, or special characters like “@” or “$”

Question 11

Q

What are the S3 bucket restrictions and limitations?

Answer

A

Up to 100 buckets, 1000 after a service request
Buckets must be empty before deletion
No max bucket size or limit to the number of objects
Files must be between 0 and 5 TBs (multipart upload recommended for files over 100MB)
Specific limits for S3 on AWS Outposts

Question 12

Q

What are the two types of S3 buckets?

Hallmarks in comparison.

Answer

A

General Purpose: flat hierarchy, all storage classes except S3 Express One Zone, recommended for most use cases, no prefix limits, 100 per account
Directory: folder hierarchy, only S3 Express One Zone storage type, recommended for single-digit millisecond performance on PUT and GET, no prefix limits, 10 per account

Question 13

Q

What are the characteristics of S3 general purpose bucket folders?

Answer

A

Do not have true folders
Creating a folder creates a zero-byte object ending in a forward slash (e.g., myfolder/)
Files in a folder have names appended with the folder prefix

Question 14

Q

What is unique about S3 folders?

Answer

A

They are S3 objects and not independent entities
Do not include permissions or metadata
Can’t be empty or full
Aren’t moved; objects with the same prefix are renamed when moved

Question 15

Q

What happens to the prefix when moving an S3 object to another folder?

Answer

A

Only the prefix is changed; the object itself is not physically moved.

Question 16

Q

What is metadata in S3?

Answer

A

Metadata provides information about other data but not the content itself. It is useful for categorizing, organizing data, and providing context about data.

Question 17

Q

When can we attach metadata to S3 Objects?

Answer

A

Metadata can be attached to S3 Objects at any time.

Question 18

Q

What are the types of metadata in S3?

Answer

A

Metadata can be either system-defined or user-defined.

Question 19

Q

Who sets system-defined metadata in S3?

Answer

A

System-defined metadata is set by Amazon (with some exceptions).

Question 20

Q

How must user-defined metadata be formatted in S3?

Answer

A

User-defined metadata must begin with “x-amz-meta-“. When using the AWS CLI, it should be set as “key=value”, which will automatically be appended with the “x-amz-meta-“ prefix.

Question 21

Q

What does WORM stand for and what does it mean?

Answer

A

WORM stands for Write Once Read Many, meaning the data is immutable and cannot be modified or deleted.

Question 22

Q

What is Object Lock in S3?

Answer

A

Object Lock prevents deletions of objects in a bucket. It can only be enabled on bucket creation and is useful for data integrity and regulatory compliance.

Question 23

Q

What are the two types of retention in Object Lock?

Answer

A

Retention period: fixed time
Legal hold: until removed

Question 24

Q

What are the two request styles in S3?

Answer

A

Virtual hosted-style requests: the bucket name is a subdomain on the host.
Path-style requests: the bucket name is in the request path.

Question 25

Q

What will happen to path-style requests in S3?

Answer

A

Path-style requests will be discontinued, and some features work only with virtual hosted-style requests.

Question 26

Q

What is S3 Standard storage class?

Answer

A

S3 Standard is the default storage class, designed for general-purpose storage for frequently accessed data.

Question 27

Q

What are the key features of S3 Standard?
How durable it is etc.

Answer

A

High Durability: 11 9’s of durability (99.999999999%)
High Availability: 4 9’s of availability (99.99%)
Data Redundancy: Data stored in 3 or more AZs
Retrieval Time: within milliseconds
High Throughput: optimized for frequently accessed and/or real-time access data
Scalability: easily scales to storage size and number of requests
No retrieval fee
No minimum storage duration charge

Question 28

Q

What is S3 Reduced Redundancy Storage (RRS)?

Answer

A

S3 Reduced Redundancy Storage is a legacy storage class for non-critical reproducible data with lower redundancy than S3 Standard. It provides no cost-benefit and is no longer cost-effective but still available for legacy customers.

Question 29

Q

What are the S3 storage classes sorted by price from highest to lowest?

Answer

A

S3 Standard
S3 Intelligent Tiering
S3 Express One-Zone
S3 Standard-IA (Infrequent Access)
S3 One-Zone-IA
S3 Glacier Instant Retrieval
S3 Glacier Flexible Retrieval
S3 Glacier Deep Archive

Question 30

Q

What is unique about S3 Intelligent Tiering?

Answer

A

S3 Intelligent Tiering uses AI to determine the storage class and has an extra fee for analysis.

Question 31

Q

What is S3 Glacier Instant Retrieval?

Answer

A

S3 Glacier Instant Retrieval is designed for long-term cold storage with instant retrieval.

Question 32

Q

What are the retrieval options for S3 Glacier Flexible Retrieval?

Answer

A

Standard Retrieval: Typically takes 3-5 hours
Expedited Retrieval: Typically takes 1-5 minutes
Bulk Retrieval: Typically takes 5-12 hours

Question 33

Q

What is S3 Express One Zone designed for?

Answer

A

S3 Express One Zone is made for consistent single-digit millisecond data access, best suited for frequently accessed data and latency-sensitive applications.

Question 34

Q

What are the key features of S3 Express One Zone?

Answer

A

The lowest latency available

Access speed up to 10x faster than Standard

Request costs 50% lower than Standard

Data is stored in a single AZ chosen by the user

Data is stored in an S3 Directory Bucket

Question 35

Q

How are request costs structured for S3 Express One Zone?

Answer

A

For request sizes up to 512 KB, there is a flat per request charge. For portions of requests greater than 512 KB, there are additional per GB charges for PUT and GET operations.

Question 36

Q

What is S3 One Zone-IA designed for?

Answer

A

S3 One Zone-IA is designed for less frequently accessed data with reduced availability.

Question 37

Q

What are the key features of S3 One Zone-IA?
How durable it is etc?

Answer

A

High Durability: 11 9’s of durability (99.999999999%)
Lower Availability: 99.5% because it is in one AZ
Cost-Effective Storage: costs 20% less than Standard-IA (which is 50% cheaper than Standard)
Data Redundancy: Risk of data loss due to storing in only one AZ
Retrieval Time: within milliseconds
Use Cases: Secondary backup copies of on-premise data, for recreating data in case of AZ failure. Not frequently accessed, non-mission-critical data.
Pricing:
- Storage per GB
- Per Request
- Minimum storage duration charge (30 days)
- Retrieval fee

Question 38

Q

What is S3 Glacier Instant Retrieval designed for?

Answer

A

S3 Glacier Instant Retrieval is designed for rarely accessed data that needs instant access.

Question 39

Q

What are the key features of S3 Glacier Instant Retrieval?
How durable it is etc?

Answer

A

High Durability: 11 9’s of durability (99.999999999%) like Standard
High Availability: 3 9’s of availability (99.9%) like Standard-IA
Cost-Effective Storage: 68% lower cost than Standard-IA if data is long-lived and accessed once per quarter
Retrieval Time: within milliseconds
Use Cases: Rarely accessed data that needs instant access, such as image hosting, online file-sharing apps, medical imaging and health records, news media assets, satellite and aerial imaging.
Pricing:
- Storage per GB
- Per Request
- Minimum storage duration charge (90 days)
- Retrieval fee

Question 40

Q

What is S3 Glacier Flexible Retrieval?

Answer

A

S3 Glacier Flexible Retrieval combines S3 and Glacier into a single set of APIs, providing faster retrieval than Vault-based S3 Glacier.

Question 41

Q

What are the retrieval tiers for S3 Glacier Flexible Retrieval?

Answer

A

Expedited: 1-5 minutes, urgent requests, limited to 250 MB archive size

Standard: 3-5 hours, default option, no archive limit

Bulk: 5-12 hours, no archive limit

Question 42

Q

What are the additional costs associated with S3 Glacier Flexible Retrieval?

Answer

A

Separate costs from the cost of storage, including per GB and per request charges. Archived objects are expanded by an additional 40KBs (32KBs for index and metadata, 8KBs for object name).

Question 43

Q

What is S3 Glacier Deep Archive?

Answer

A

S3 Glacier Deep Archive combines Amazon S3 and Amazon S3 Glacier into a single set of APIs, offering more cost-effective storage than Glacier Flexible Retrieval but with higher retrieval costs.

Question 44

Q

What are the retrieval tiers for S3 Glacier Deep Archive?

Answer

A

Standard: within 12 hours

Bulk: within 48 hours, no archive limit, suitable for very large amounts of data (petabytes)

No Expediated Tier

Question 45

Q

What are the additional costs associated with S3 Glacier Deep Archive?

Answer

A

Separate costs from the cost of storage, including per GB and per request charges. Archived objects are expanded by an additional 40KBs (32KBs for index and metadata, 8KBs for object name).

Question 46

Q

What is S3 Intelligent-Tiering?

Answer

A

S3 Intelligent-Tiering automatically moves objects into different storage tiers to reduce storage costs, with separate costs for object monitoring and automation.

Question 47

Q

What are the tiers in S3 Intelligent-Tiering?

Answer

A

Frequent Access (automatic): Default tier

Infrequent Access (automatic): If the object is not accessed after 30 days

Archive (Glacier) Instant Access (automatic): If the object is not accessed after 90 days

Archive Access (optional): If the object is not accessed after 90 days and after activating this option

Deep Archive Access (optional): If the object is not accessed after 180 days and after activating this option

Question 48

Q

What are the additional costs associated with S3 Intelligent-Tiering?

Answer

A

Additional costs for analyzing objects for at least 30 days.

Question 49

Q

What is Amazon S3 Block Public Access?

Answer

A

Amazon S3 Block Public Access is a safety feature enabled by default that blocks all public access to an S3 bucket.

Question 50

Q

What does the setting “Block public access to buckets and objects granted through new access control lists (ACLs)” do?

Answer

A

It blocks public access permissions applied to newly added buckets or objects and prevents the creation of new public access ACLs for existing buckets and objects. It does not change existing permissions that allow public access to S3 resources using ACLs.

Question 51

Q

What does the setting “Block public access to buckets and objects granted through any access control lists (ACLs)” do?

Answer

A

S3 will ignore all ACLs that grant public access to buckets and objects.

Question 52

Q

What does the setting “Block public access to buckets and objects granted through new public bucket or access point policies” do?

Answer

A

S3 will block new bucket and access point policies that grant public access to buckets and objects. This setting doesn’t change any existing policies that allow public access to S3 resources.

Question 53

Q

What does the setting “Block public and cross-account access to buckets and objects through any public bucket or access point policies” do?

Answer

A

S3 will ignore public and cross-account access for buckets or access points with policies that grant public access to buckets and objects.

If buckets or access point have policies that grant public access then those policies will be ignored.

Question 54

Q

What are the four options available for Amazon S3 Block Public Access?

Answer

A

New Access Control Lists
Any Access Control Lists
New Bucket Policies or Access Points
Any Bucket Policies or Access Points

Question 55

Q

What is an Access Control List (ACL) in Amazon S3?

Answer

A

An ACL grants basic read/write permissions to other AWS accounts. It is a legacy method not recommended for current use.

Question 56

Q

What limitations do ACLs have in Amazon S3?

Answer

A

You can grant permissions only to other AWS accounts.
You cannot grant permissions to users in your account.
You cannot grant conditional permissions.
You cannot explicitly deny permissions.

Question 57

Q

What are more robust alternatives to ACLs for providing cross-account access in Amazon S3?

Answer

A

More robust alternatives include bucket policies and access points.

Question 58

Q

What is the primary difference between Bucket Policies and IAM Policies?

Answer

A

Bucket Policies are more convenient for granting access to specific S3 buckets and their objects, while IAM Policies provide access to many AWS services and can cover multiple buckets in one policy.

They differ also in size:
Bucket Policy up to 20KB
IAM Policy: User - 2KB, Group - 5KB, Role - 10KB.

Question 59

Q

What is the purpose of Access Grants in relation to S3?

Answer

A

To map identities in a directory service (e.g., IAM Identity Center) to access datasets in S3.

Question 60

Q

What does the Access Analyzer for S3 do?

Answer

A

It alerts when an S3 bucket is exposed to the Internet or to other AWS accounts.

Question 61

Q

How can you ensure Internetwork Traffic Privacy across different networks?

Answer

A

Through AWS PrivateLink (VPC Interface Endpoints) or VPC Gateway Endpoints.

Question 62

Q

What is S3 CORS?

Answer

A

It can be set for S3 bucket with static website hosting so other origins can perform HTTP requests from this website.

Question 63

Q

What are encryption types and their key points?
When data is encrypted

Answer

A

Encryption types:
- In Transit - data is encrypted by the sender and decrypted by the receiver
- At Rest :
1. Client-Side Encryption (CSE) - data is encrypted by the client with a key and sent to a server. The server cannot decrypt the data because it doesn’t have the key.
2. Server-Side Encryption (SSE) - data is encrypted by the server and decrypted when requested by the client.

Question 64

Q

What are the two main types of encryption for AWS S3?

Answer

A

Client-Side Encryption (CSE) and Server-Side Encryption (SSE).

Answer 65

A

Data is encrypted by the client with a key before being sent to the server, and the server cannot decrypt the data.

Answer 66

A

Data is encrypted by the server and decrypted when requested by the client.

Answer 67

A

SSE-S3 - key is managed by Amazon S3 using AES-GCM/AES256 algorithm
SSE-KMS - you are managing the key using AWS Key Management Service (KMS)
SSE-C - customer provides and manages the key
DSSE-KMS - key is managed by the AWS KMS, data is encrypted twice with two different keys.

Answer 68

A

A short-lived bucket-level key used to decrease costs and improve performance, applicable to SSE-S3 and SSE-KMS.

Answer 69

A

Yes, they can be applied at the bucket level for all new objects or at the object level for specific objects.

Answer 70

A

The client encrypts data before uploading it to S3, and the server does not handle decryption.

Answer 71

A

Strongly Consistent: Data is always consistent but may have a slight delay.
Eventually Consistent: Data might not be consistent immediately but will become consistent over time.

Answer 72

A

Cross-Region Replication (CRR)
Same-Region Replication (SRR)
Bi-Directional Replication (BDR)
S3 Batch Replication

Answer 73

A

Versioned
Unversioned
Versioning Suspended

Answer 74

A

Versions are stored in the same object key address.
Must be explicitly enabled; it is disabled by default.
Can only be suspended, not disabled.
Integrates with S3 Lifecycle rules.
MFA Delete provides extra protection.

Answer 75

A

A feature that speeds up file transfers to S3 by using CloudFront’s edge locations.
Users upload to a distinct endpoint that routes through edge locations to the Amazon Global Network.
https://s3-accelerate.amazonaws.com and https://s3-accelerate.dualstack.amazonaws.com support both IPv4 and IPv6.
only virtual-hosted style requests are supported
buckets cannot contain periods and must be DNS-compliant

Answer 76

A

Provide temporary access to upload or download objects.
Presigned URLs are commonly used to grant access to private objects.
Presigned URLs are generated via AWS CLI or AWS SDK.

Answer 77

A

Simplify managing data access at scale for shared datasets.

Answer 78

A

Distinct permissions via an Access Points Policy
Distinct network controls
Distinct block public access feature

Answer 79

A

Multi-Region Access Point is a global endpoint to route requests to multiple buckets residing in different regions.
Multi-Region Access Point will return data from the regional bucket with the lowest latency.

Answer 80

A

Transform the output of S3 object requests to present data differently without modifying the objects in the bucket.
Can perform transformations on HEAD, GET, and LIST operations.

Answer 81

A

Mount an S3 bucket to a Linux local file system.
Supports basic file-system operations like reading files up to 5 TB, listing, creating new files.

Answer 82

A

Modify existing files, delete directories, support symbolic links or file locking.

Answer 83

A

Through Archive Storage Classes (e.g., Glacier Deep Archive) or Archive Access Tiers (e.g., Intelligent-Tiering).

Answer 84

A

S3 Standard
S3 Standard-IA
S3 Express One Zone
S3 One Zone-IA
Reduced Redundancy Storage (RRS)
S3 Glacier Instant Retrieval

Answer 85

A

Allows bucket owners to offset costs to the requester who accesses the data.
Requester pays for data transfer and request costs, while the bucket owner pays for everything else.

Answer 86

A

Include x-amz-requester-payer header or parameter in the request.

Answer 87

A

A 403 (Forbidden Request) HTTP Error code

Answer 88

A

x-amz-request-payer not included
Request authentication fails (something is wrong with IAM role or IAM policy)
The request is anonymous
The request is a SOAP request (which is not allowed)

Answer 89

A

Alternatives to AWS Services that work with Amazon S3.

Answer 90

A

Performing large-scale batch operations on Amazon S3 objects, involving billions of objects and exabytes of data.

Answer 91

A

Copies each object listed in the manifest to the specified destination bucket.

Answer 92

A

Run a Lambda function against each object in the batch.

Answer 93

A

Replaces the Amazon S3 object tags of each object in the batch.

Answer 94

A

Replaces the ACLs for each object in the batch.

Answer 95

A

Sends a restore request to S3 Glacier for each object.

Answer 96

A

Prevents overwriting or deleting objects for a fixed amount of time.

Answer 97

A

Prevents overwriting or deleting objects until the legal hold is removed.

Answer 98

A

Provide lists of objects in an S3 bucket or supply an S3 Inventory report manifest.json.

Answer 99

A

Daily (within 48 hours) or Weekly (first report within 48 hours, future reports every Sunday).

Answer 100

A

Allows S3 Buckets to notify other AWS Services about S3 events and helps with application integration.

Answer 101

A

Analyzes storage access patterns of objects within a bucket and recommends moving objects between Standard and Standard-IA classes.

Answer 102

A

A storage analysis tool for S3 buckets across an AWS Organization, providing an interactive dashboard updated daily.

Answer 103

A

Allows hosting and serving a static website from an S3 bucket with a website endpoint. HTTPS is not supported directly; Amazon CloudFront is required for HTTPS.

Answer 104

A

A feature that allows uploading a single object in a set of parts, recommended for files larger than 100MB.

Answer 105

A

Improved throughput, reuploading only missing parts in case of network failure, no expiry time for uploading parts, and the ability to upload while creating a file.

Answer 106

A

Retrieval of a specific range of bytes from an object using the Range header in GetObject API requests.

Answer 107

A

Allows multiple byte ranges to be requested simultaneously for parallel processing.

Answer 108

A

Byte range fetching retrieves specific portions of an object’s data regardless of format, while S3 Select allows querying object content with SQL statements.