S3 Flashcards
What is S3?
S3 provides developers and IT teams with secure, durable, highly scalable object storage. It comes with a simple web services interface to store and retrieve any amount of data from anywhere on the web.
It is like a hard disk on the cloud
S3 allows storage of what type of objects?
Files from 1 byte to 5TB in size. It is not suitable to install an OS on S3.
What is the storage limit for each account?
Unlimited
What are buckets?
Akin to a folder on a file system
S3 is a universal namespace, meaning…
Names must be unique globally
S3 provides read after write consistency for PUTS of new objects meaning…
Objects are available to read immediately upon storage in the cloud
S3 provides eventual consistency for overwrite PUTS and DELETES meaning…
Modifications to stored objects are eventually propagated in the cloud
S3 objects are simple key, value stores consisting of:
- Key - the name of the object
- Value - the data consisting of a sequence of bytes
- Version ID
- Metadata - data about the data being stored
- Subresources
- Access Control Lists
What availability does Amazon guarantee?
99.99%
S3 features include:
99.999999999% durability (11 X 9's) Tiered storage Lifecycle management Versioning Encryption Access Control Lists and Bucket policies
S3 - IA storage tier is good for
Data that is accessed less frequently, but requires rapid access when needed. Lower fee than S3, but you are charged a retrieval fee.
Provides SSL support
Reduced Redundancy Storage tier
Designed to provide 99.99% durability and 99.99% availability of objects over a given year.
Provides SSL support
Glacier
Very cheap, but used for archival only. It takes 3.-5 hours to restore from Glacier. It costs as little 1 cent per GB/month
Provides SSL support
What does Glacier charge for?
Storage
Requests
Data Transfers
Lifecycle management transition points include
Transition to the Standard - Infrequent Access Storage Class (128kb and 30 days after the creation date)
Archive to the Glacier Storage Class (30 days after IA, if relevant)
Permanently Delete
To what object versions can lifecycle management be applied?
current and previous versions
What is a CDN?
A content delivery network (CDN) is a system of distributed servers (network) that deliver webpages and other web content to a user based on the geographic locations of the user, the origin of the webpage and a content delivery server.
Edge Location
This is the location where content will be cached. This is separate to an AWS Region/AZ
Origin
This is the origin of all the files that the CDN distribute. This can be either an S3 bucket, an EC2 instance, an Elastic Load Balancer or Route53
Distribution
This is the name given the CDN which consists of a collection of Edge locations
What methods can be used to set up access control to buckets?
Bucket Policies
Access Control Lists
True or False. All newly created buckets are PUBLIC.
FALSE. All newly created buckets are PRIVATE.
What log auditing do buckets feature?
S3 buckets can be configured to log access requests. These logs can be done to another bucket or AWS account
What are the two types of encryption for S3 buckets?
In Transit (SSL/TLS) At Rest (Server and client side)
List the server side encryption methods
S3 managed keys (SSE-S3)
AWS Key management (SSE-KMS)
Customer Provided Keys (SSE-C)
What is storage gateway?
A service that connects an on-premise software appliance with cloud-based storage to provide seamless and secure integration between an organizations on-premise IT environment and AWS storage infrastructure.
What are the three types of storage gateways?
Gateway Stored Volumes
Gateway Cached Volumes
Gateway Virtual Tape Library
What architecture does gateway stored volumes employ?
Entire data set stored locally.
Gateway backs data up asynchronously to S3.
Good solution for durable and inexpensive backups that you can recover locally or from EC2
What architecture does gateway cached volumes employ?
Most frequently accessed data is stored locally
Entire data set is stored in S3.
Must have internet connectivity to access all of your data.
Describe Virtual Tape Library (VTL)
Mimics physical tape library. Uses an industry standard iSCSI interface which provides online access to virtual tapes for popular apps like NetBackup, BackupExec, Veam, etc.
List the two types of Export/Import
Import/Export Disk
Import/Export Snowball
What is Import/Export Disk
Allows bypass of the internet by using portable devices for transport which is used on Amazon’s internal high-speed network.
Supports import to EBS, S3, Glacier and export from S3
What is snowball used for?
Transfer of petabyte-scale data storage.
supports import/export to S3 only
What is transfer acceleration?
Method used to accelerate uploads to S3 through the use of URLs to upload to a nearby edge location which will transfer the data to S3