S3 Flashcards
Amazon S3
Is an object storage service offering industry-leading scalability, data availability, security, and performance.
-Unlimited
-Many websites use Amazon S3 as a backbone
S3 Use cases
• Backup and storage • Disaster Recovery
• Archive • Hybrid Cloud storage
• Application hosting • Media hosting
• Data lakes & big data analytics
• Software delivery • Static website
S3 Buckets
Allows people to store objects (files) in “buckets” (directories)
• Buckets must have a globally unique name (across all regions all accounts)
• Buckets are defined at the region level
S3 Objects
• Objects (files) have a Key
• Max Object Size is 5TB
• If uploading more than 5GB, must use “multi-part upload”
• The key is composed of prefix + object name s3://mybucket/
my_folder1/another_folder/my_file.txt (THIS)
S3 Security
• User based = IAM policies for users and for services
• Resource Based = Bucket Policies + Public Access (allows cross account) – Object Access Control List (ACL) – Bucket Access Control List (ACL)
• Encryption: encrypt objects in Amazon S3 using encryption keys
S3 Bucket Policies
• JSON based policies
Use S3 bucket for policy to:
• Grant public access to the bucket
• Force objects to be encrypted at upload
• Grant access to another account (Cross
Account
Bucket settings for Block Public Access
• These settings were created to prevent company data leaks
S3 Websites
• S3 can host static websites and have them accessible on the www
S3 - Versioning
• It is enabled at the bucket level
• Protect against unintended deletes (ability to restore a version)
• Easy roll back to previous version
S3 Access Logs
• For audit purposes
• Any request made to S3, from any account, authorized or denied, will be logged into another S3 bucket
• Very helpful to come down to the root cause of an issue, or audit usage, view suspicious patterns, etc…
S3 Replication (CRR & SRR)
• Must enable versioning in source and destination
• Buckets can be in different accounts
• Copying is asynchronous
• Must give proper IAM permissions to S3
• Cross-Region Replication (CRR) - Use cases: compliance, lower latency access, replication across accounts
• Same-Region Replication (SRR) – Use cases: log aggregation, live replication
S3 Storage Classes
• Amazon S3 Standard - General Purpose
• Amazon S3 Standard-Infrequent Access (IA)
• Amazon S3 One Zone-Infrequent Access
• Amazon S3 Glacier Instant Retrieval
• Amazon S3 Glacier Flexible Retrieval
• Amazon S3 Glacier Deep Archive
• Amazon S3 Intelligent Tiering
S3 Durability and Availability
• Durability = High durability (99.999999999%, 11 9’s) of objects across multiple AZ
• Availability = Measures how readily available a service is, varies depending on storage class
S3 Standard – General Purpose
• 99.99% Availability
• Used for frequently accessed data
• Low latency and high throughput
• Sustain 2 concurrent facility failures (3 AZs)
• Use Cases: Big Data analytics, mobile & gaming applications,
content distribution…
S3 Storage Classes – Infrequent Access
• For data that is less frequently accessed, but requires rapid access when needed
• Lower cost than S3 Standard
• S3 Standard-IA = 99.9% Availability & Use cases: Disaster Recovery, backups
• S3 One Zone-IA = In a single AZ, 99.5% Availability, Use Cases: Storing secondary backup copies of on-premise data, or data you can recreate