S3 Flashcards
S3
Simple Storage Service
What is S3
provides developers and IT Teams with secure, durable, highly-scalable object storage.
Object based storage
where you can store files, pictures, pdfs etc
Block Based Storage
EC2 - where you install operating system, databases or applications
File size that can be stored in S3
0 - 5 TB
S3 storage
unlimited
S3 Files are stored in
Buckets
S3 naming
universal namespace - it has be unique globally;
Sample of an S3 name
https://s3-eu-west-1.amazonaws.com/acloudguru
When you upload a file to S3, this will be returned when the upload is successful
HTTP 200
Data consistency model for S3
- Read after write consistency for PUTS of new objects
2. Eventual Consistency for overwrite PUTS and DELETES (can take sometime to propagate)
Read after Write consistency
Uploading a file and able to read/access it right away (millisecond after)
Eventual consistency
if we’re updating/deleting a file, we might get 2 files -old and new but eventually you’ll be able to get the new file (after a min) - delay is due to the fact that S3 is spread across multiple AZs
S3 is object based; objects consists of the following
- Key
- Value
- Version ID
- Metadata
- Subresources
Key
name of the object
Value
simply the data and is made up of sequence of bytes; it’s the data inside the files (hello cloud gurus)
Metadata
tags (owned by sales marketing etc)
Access Control List
putting individual permissions on a file
Version ID
important for versioning
S3 durability for all storage classes
99.99999999 (11x9s)
S3 Tiered Storage
storage classes
LIfecycle Management
archiving files; moving files from one storage tier to another based on how old the file is
Versioning
multiple versions of the file
Securing your data in S3 using:
- Access Control Lists
2. Bucket Policies
Access Control Lists
goes down to the individual file level
Bucket Policies
locking down the bucket itself at the bucket level
S3 has the following features
- Tiered Storage Available
- Lifecycle Management
- Versioning
- Encryption
- MFA Delete
- Securing data using ACL and BP
S3 Storage Classes
- S3 Standard
- S3 Intelligent Tiering
- S3 Standard-IA
- S3 One Zone-IA
- S3 Glacier
- S3 Glacier Deep Archive
S3 Standard Availability Availability SLA Availability Zones Min capacity charge/object Min storage duration charge Retrieval Fee First byte latency
99.99%
99.9%
>=3
NA
NA
NA
milliseconds
S3 Standard is stored
redundantly across multiple devices in multiple facilities and is designed to sustain the loss of 2 facilities concurrently
S3 Standard - IA Availability Availability SLA Availability Zones Min capacity charge/object Min storage duration charge Retrieval Fee First byte latency
99.9% 99% >=3 128kb 30 days Per GB retrieved milliseconds
S3 - IA is for
Use case
- data that is accessed less frequently but requires rapid access when needed
- lower fee than S3 but you are charged a retrieval fee
ideally suited for long-term file storage, older sync and share storage, and other aging data
S3 One Zone - IA Availability Availability SLA Availability Zones Min capacity charge/object Min storage duration charge Retrieval Fee First byte latency
99.5% 99% 1 128kb 30 days Per GB retrieved milliseconds
S3 One Zone - IA is for
Use case
- were you want a lower cost option for infrequently accessed data but do not require the multiple AZ data resilience
backup copies, disaster recovery copies or other easily re-creatable data.
S3 Intelligent Tiering Availability Availability SLA Availability Zones Min capacity charge/object Min storage duration charge Retrieval Fee First byte latency
99.9% 99% >=3 NA 30 days NA milliseconds
S3 - Intelligent Tiering
Use case
Designed to optimize costs by automatically moving data to the most cost-effective access tier, without performance impact or operational overhead
Unknown access patterns;an also be used to store new data sets where, shortly after upload, access is frequent, but decreases as the data set ages. Then you can move the data set to S3 One Zone-IA or archive it to S3 Glacier.
S3 Glacier Availability Availability SLA Availability Zones Min capacity charge/object Min storage duration charge Retrieval Fee First byte latency
99.99%
99.9%
>=3
40 KB
90 days
per GB retrieved
select minutes or hours
S3 Glacier
Retrieval times
Use case
- secure, durable and low-cost storage class for data archiving
- retrieval times configurable from minutes to hours
media asset workflows, healthcare information archiving, regulatory and compliance archiving, scientific data storage, digital preservation, magnetic tape replacement