Storage Flashcards
EFS Performance Modes
Max I/O
General Purpose
What are AWS services that allow you to share file system across multiple EC2 instances?
Amazon EFS
Amanzon FSx for Windows
Amazon FSx for Lustre
How long does it take to access data in
a. AWS Glacier
b. Glacier Deep Archive?
A. A few minutes to hours
B. Less than 12 hours
Amazon S3 One Zone-Infrequent Access (S3 One Zone-IA)
Name typical use cases
S3 One Zone-IA stores data in a single AZ and costs 20% less than S3 Standard-IA
good choice for storing secondary backup copies of on-premises data or easily re-creatable data
Amazon S3 Intelligent-Tiering (S3 Intelligent-Tiering)
use when data access patterns change, without performance impact or operational overhead
S3 Intelligent-Tiering monitors access patterns and then moves objects that have not been accessed in 30 consecutive days to the Infrequent Access tier.
Once you have activated one or both of the Archive Access tiers, S3 Intelligent-Tiering will move objects that haven’t been accessed for 90 consecutive days to the Archive Access tier and then after 180 consecutive days of no access to the Deep Archive Access tier.
List S3 Access Lifecycles in order of hottest (most accessible) to coldest (rarely accessed) and/or cost.
- Standard
- Standard Intelligent Tiering
- Standard Infrequently Accessed (IA)
- Standard IA - One Zone
- AWS Glacier
- AWS Deep Glacier
AWS Storage Gateway - Tape Gateway
AWS Storage Gateway - Tape Gateway allows moving tape backups to the cloud.
AWS Storage Gateway - Volume Gateway
You can configure the AWS Storage Gateway service as a Volume Gateway to present cloud-based iSCSI block storage volumes to your on-premises applications.
Volume Gateway stores and manages on-premises data in Amazon S3 on your behalf and operates in either cache mode or stored mode.
AWS Storage Gateway - Cached Volume Gateway mode
primary data is stored in Amazon S3, while retaining your frequently accessed data locally in the cache for low latency access
AWS Storage Gateway - File Gateway
AWS Storage Gateway’s file interface, or file gateway, offers you a seamless way to connect to the cloud in order to store application data files and backup images as durable objects on Amazon S3 cloud storage. File gateway offers SMB or NFS-based access to data in Amazon S3 with local caching
AWS Storage Gateway
The service provides three different types of gateways – Tape Gateway, File Gateway, and Volume Gateway – that seamlessly connect on-premises applications to cloud storage, caching data locally for low-latency access.
Supported S3 lifecycle transitions
Waterfall approach with the following sequence Std, IA, Intelligence Tiering, One Zone IA, Glacier, and Deep Glacier
The S3 Standard storage class to any other storage class.
Any storage class to the S3 Glacier or S3 Glacier Deep Archive storage classes.
The S3 Standard-IA storage class to the S3 Intelligent-Tiering or S3 One Zone-IA storage classes.
The S3 Intelligent-Tiering storage class to the S3 One Zone-IA storage class.
The S3 Glacier storage class to the S3 Glacier Deep Archive storage class.
RRS can’t be transitioned from any storage class
S3 Glacier Deep Archive
Use for archiving data that rarely needs to be accessed. Minimum storage duration period of 180 days and a default retrieval time of 12 hours. If you have deleted, overwritten, or transitioned to a different storage class an object before the 180-day minimum, you are charged for 180 days.
S3 Glacier
Use for archives where portions of the data might need to be retrieved in minutes. Data stored in the S3 Glacier storage class has a minimum storage duration period of 90 days and can be accessed in as little as 1-5 minutes using expedited retrieval. If you have deleted, overwritten, or transitioned to a different storage class an object before the 90-day minimum, you are charged for 90 days.
• Expedited retrievals are typically made available within 1 – 5 minutes. • Standard retrievals typically complete within 3 – 5 hours. • Bulk retrievals typically complete within 5 – 12 hours.
S3 IA, S3 One Zone IA, Typical Use Cases
S3 Standard-IA and S3 One Zone-IA storage classes are designed for long-lived and infrequently accessed data. (IA stands for infrequent access.) S3 Standard-IA and S3 One Zone-IA objects are available for millisecond access (similar to the S3 Standard storage class). Amazon S3 charges a retrieval fee for these objects, so they are most suitable for infrequently accessed data.
S3 Standard-IA — Use for your primary or only copy of data that can’t be re-created.
S3 One Zone-IA — Use if you can re-create the data if the Availability Zone fails, and for object replicas when setting S3 Cross-Region Replication (CRR).
S3 Reduced Redundancy
What happens if data is lost?
What is the annual expected loss percentage/decimal?
The Reduced Redundancy Storage (RRS) storage class is designed for noncritical, reproducible data that can be stored with less redundancy than the S3 Standard storage class.
For durability, RRS objects have an average annual expected loss of 0.01 percent of objects. If an RRS object is lost, when requests are made to that object, Amazon S3 returns a 405 error.
S3 Standard
The default storage class. If you don’t specify the storage class when you upload an object, Amazon S3 assigns the S3 Standard storage class.
S3 Intelligent-Tiering storage class
designed to optimize costs by automatically moving data to the most cost-effective access tier, without performance impact or operational overhead. It works by storing objects in two access tiers: one tier that is optimized for frequent access and another lower-cost tier that is optimized for infrequent access. The minimum storage duration charge is 30 days
AWS Snowmobile
Exabyte scale data transfer - tracker trailer
Transfer 100 PB/s
Enhanced security features - GPS tracking, alarm, 24/7 video surveillance, encryption etc.
AWS Snowball & AWS Snowball Edge Optimized
Pedabyte scale transport appliance which is shipped to you to attach to your local network and transfer files directly to it.
Device will be shipped back to Amazon and data transferred to S3 into the network. Elink shipping label will automatically update and can be track via SNS or console.
Edge Optimized - ideal for transfer scenarios that require additional compute in remote, disconnected or harsh environments
AWS S3 Acceleration - what is it and when do you use it
Allows fast and easy data transfer into S3 by using CloudFront’s edge locations, where the data is routed to S3 over optimized network paths
Use it when:
- Have customers all over the world uploading to central bucket
- Transfer gigs or TBs of data across continents
- Underutilize the available bandwidth when uploading to S3 over the internet
AWS Transfer for SFTP
Fully managed highly available SFTP, service that enables applications to transfer files over SFTP directly to S3
You create server, set up user accounts and associate server with one or more S3 buckets
AWS Data Sync
How is the service billed?
Online data transfer service that simplifies, automates, and accelerates copying large amounts of data between on-premises storage systems and AWS Storage services, as well as between AWS Storage services.
DataSync can copy data between Network File System (NFS), Server Message Block (SMB) file servers, Hadoop Distributed File Systems (HDFS), self-managed object storage, AWS Snowcone, Amazon Simple Storage Service (Amazon S3) buckets, Amazon Elastic File System (Amazon EFS) file systems, and Amazon FSx for Windows File Server file systems.
Only pay for the data you copy
AWS S3 CORS
How is it configured?
Allows configuration of your bucket to allow cross origin requests by defining:
- Origins that you allow to access bucket
- The HTTP methods that will support each origin
- Other operations specific info (e.g. allowed heards, max age, etc.)