S3 Flashcards
What is the max object size that can be uploaded?
5TB (5000GB)
How can you upload an object larger than 5GB?
Must use ‘multi-part upload’ for greater than 5GB
Under what conditions can an IAM principal access an S3 object?
- The user IAM permissions ALLOW it OR the resource policy ALLOWS it
- AND there is no explicit DENY
How could you allow an EC2 instance to access an S3 bucket?
Create an IAM Role, assign the correct permissions, and assign the role to the EC2 instance.
How would you allow another user to access a bucket you have created?
Create a bucket policy for the S3 bucket that allows cross-account.
What are the two types of S3 replication?
CRR (cross region replication) and SRR (same region replication)
What are the requirements for replication?
- Must enable Versioning in source and destination buckets
- Must give proper IAM permissions to S3
What are the use cases for S3 replication (for both types)?
CRR - compliance, lower latency access, replication across accounts
SRR - log aggregation, live replication between production and test accounts
What is a limitation of replication?
Only new objects are replicated
What is S3 Batch Replication?
S3 Batch Replication - replicates existing objects and objects that failed replication
What are delete markers, and how do they (optionally) tie in to replication?
With versioning turned on, a delete marker is a marker placed on an object that has been deleted - this allows a roll-back of the delete using the delete marker version.
For replication, delete markers (if enabled) are replicated from source to target. Deletions with a version ID (i.e., a permanent delete) are not replicated (to avoid malicious deletes).
List the S3 Storage Classes
- Standard - General Purpose
- Standard-Infrequent Access (IA)
- One Zone-Infrequent Access
- Glacier Instant Retrieval
- Glacier Flexible Retrieval
- Glacier Deep Archive
- Intelligent Tiering
What is meant by Durability?
- How many times an object is going to be lost by S3
- Same Durability for all storage classes
99.999999999% (11 9s) durability.
What is meant by Availability?
- How available a service is (i.e., uptime vs downtime)
- Varies depending on storage class
S3 standard is 99.99% available (53 minute downtime per year)
Describe S3 Standard - General Purpose, and give a use case.
- 99.99% availability
- Used for frequently accessed data
- Low latency and high throughput
- Sustain 2 concurrent facility failures
Big data analytics, mobile & gaming applications, content distribution
Describe S3 Standard & One Zone - Infrequent Access, and give a use case.
- Used for less frequently accessed data, but rapid access when needed
- Lower cost than Standard
Standard-IA:
99.9% available
Disaster recovery, backups
One Zone-IA
Single AZ - data lost when AZ is destroyed
99.5% available
Storing secondary backup copies of on-premise data, or data you can recreate
Describe S3 Glacier Storage Classes in general
Low cost object storage meant for archiving/backup
Pricing: Price for storage + object retrieval cost
Describe S3 Glacier Storage Class - Instant Retrieval
Millisecond retrieval, great for data accessed once a quarter
Minimum storage duration of 90 days
Describe S3 Glacier Storage Class - Flexible Retrieval
Expedited (1 to 5 minutes), Standard (3 to 5 hours), Bulk (5 to 12 hours) - bulk is free
Minimum storage duration of 90 days
Describe S3 Glacier Storage Class - Deep Archive
Standard (12 hours), Bulk (48 hours)
Minimum storage duration of 180 days
Describe S3 Intelligent Tiering
- Small monthly monitoring and auto-tiering fee
- Moves objects automatically between access tiers based on usage
- No retrieval charges in Intelligent Tiering
What lifecycle rule would you use to move an object to another storage class after a certain period of time?
Transition Action