Storage Flashcards
You have an instance store on an EC2 instance, the instance stops and is restarted but the data is gone! What happened?
Instance stores are ephemeral
What happens if you reboot an instance that has an instance store?
The data IS NOT deleted
AWS S3 - at what filesize should you consider using Multipart upload
100MB
Clients need to transfer gigabytes to terabytes of data quickly and on a regular basis to an S3 bucket. Which S3 feature will enable these long distance data transfers in a secure and fast manner?
Transfer Acceleration
What happens to a EC2 root volume on instance deletion by default
root volume is deleted
What protcol does EFS support?
NFS
4 services EFS can easily be mounted on?
EC2
ECS
EKS
Lambda
What are the storage modes EFS supports?
Standard and IA
MultiAZ by default.
Can also use lifecycle policies to age off from standard to IA
File storage service that can serve as dropin replacement for ONTAP deployments
Amazon FSx for NetApp ONTAP
What OS does Amazon FSx for NetApp ONTAP support?
Linux, Windows, and macOS compute instances running in AWS or on premises.
What protcol(s) does Amazon FSx for NetApp ONTA support?
SCSI for block storage
NFS and SMB for file storage
SMB vs NFS
NFS is for Linux
SMB is for Windows
What is Amazon FSx for OpenZFS?
- Fully managed file storage service that helps you to move data residing in on-premises ZFS
- or other Linux-based file servers to AWS without changing your application code or how you manage data
- Great for latency-sensitive and small-file workloads with NAS data management capabilities (snapshots, and cloning), at a lower price than commercially licensed alternatives.
Protocol Support for Amazon FSx for OpenZFS
NFS
SMB is not supported on initial launch
Throughput and IOPS of Amazon FSx for OpenZFS
12.5 gigabytes per second (GB/s) of throughput and up to 1 million IOPS
Which file storage option should I use if I want to easily integrate with Microsoft Active Directory?
Amazon FSx for Windows File Server
Protocol Support for Amazon FSx for Windows File Server
SMB
Max storage size of Amazon FSx for Windows File Server per file system
64 TB
Ways that I can deploy a Microsoft SQL Server in AWS?
Amazon FSx for Windows File Server
Availability of Amazon FSx for Windows File Server
1 AZ by default but can support multi-az
Which file storage option should I use if I have an HPC, Machine Learning, parallel processing app
Amazon FSx for Lustre
Availability of Amazon FSx for Lustre
Single AZ Only
Throughput and IOPS of Amazon FSx for Lustre
Highest throughput
1+ TB/s throughput
IOPS (millions)
Protocol of Amazon FSx for Lustre
Posix compliant
Only file system that supports snapshots stored in S3
Amazon FSx for Lustre
True or False: EBS supports cross region attachments to EC2
FALSE
True or False: EBS is automatically replicated and encrypted
FALSE
It is automatically replicated by not encrypted (must be turned on)
What is EBS Multi-Attach?
New service that allows connecting of EBS on mutliple EC2s.
Only supports OPS SSD (io1 or io2) volumes and EC2 instances must be in the same AZ
What is the largest volume size that EBS supports
64 TB
What type of volumes (SSD or HDD) are good for transactional workloads with frequent read/write operations with small I/O size
SSD
What type of volumes (SSD or HDD) large streaming workloads that need high throughput performance.
HDD
General Purpose SSD (gp2)
Balance Price and Performance
- 1GB - 16TB
- 16,000 max IOps
- 250 - 1,000 MB/s Throughput
- Multiattach not supported
Provisioned IOPs SSD (io1)
Designed for latency sensitive transaction workloads
- 4GB- 16TB
- 64,000 Max
- 1,000 MB/s Thoughput
- Multiattach supported
Provisioned IOPs (Express) SSD
Designed for latency sensitive transaction workloads
- up to 64TB
- 256,000 MAX IOPS
- 4,000 MB/s Throughput
- Multiattach supported
Throughput Optimized HDD (st1)
Low cost for frequently accesses throughput intensive loads (Data Warehousing)
- 125GB-16TB
- 500 max IOPS
- 500 MB/s Throughput
Cold HDD (sc1)
Lowest cost for less frequently accessed workloads
- 125GB-16TB
- 250 max IOPS
- 250 MB/s Throughput
How to make a bucket public in S3
Remove Block all Public Access
Allow ACL creation
What native S3 service can I use to transform a S3 object and return result back to application?
S3 Object Lambda
AWS Services that can run queries directly on objects in S3
Athena
Amazon Redshift
Amazon Redshift Spectrum (complex queries and large datasets (up to exabytes))
How to grant another account access to a bucket
S3 Bucket Policy
How to grant access to a single object in S3
ACL
How to grant S3 time-limited access to others with temporary URLs
Query String Authentication
How to simplify managing data access to shared datasets in S3 by creating access points with names and permissions specific to each application or sets of applications.
S3 Access Points
How to privately connect to S3 from your on-premise environment
PrivateLink
How to connect your on-premise applications with S3 over AWS Direct Connect or AWS VPN?
VPC endpoint
Can S3 Versioning be turned off?
No, but you can suspend versioning which will retain old versions but will not create new versions
How to replicate objects in S3 from a source bucket to 1 or more destination buckets across multiple regions
S3 Cross Region Replication (CRR)
How to replicate objects in S3 from a source bucket to 1 or more destination buckets across a single regions
S3 Single Region Replication (SRR)
S3 Object Lock mode that allows specific IAM users to remove lock
Governance Mode
S3 Object lock mode that does not allow any removal of lock (can never delete object)
Compliance Mode
AWS S3 service that provides metrics on size, quantity, and characteristics of your storage.
Provides suggestions on how to optimize data storage
S3 Storage Lens
AWS S3 service that provides suggestions on when to transition less used data to save $$$
Use to improve lifecylce policies
S3 Storage Class Analysis
Glacier Flexible Retrieval times
1-5 minutes
For free bulk 5-12 hours
Glacier Deep Archive Retrival times
12 hours
AWS Storage Gateway Encryption
Encryption in tranit using SSL
For S3 default is SSE but can use custom keys
AWS Storage Gateway caching
5min - 3 days
Data in cache is NOT ENCRYPTED
Use cases of AWS Storage Gateways
Backing up data to the cloud from on-prem
AWS S3 File Gateway protocols
SMB or NFS
AWS S3 File Gateway Cache size
up to 64 TB
Min recommended 150 GiB
AWS FSx File Gateway Use Case
I need a Windows Shared FSx system in AWS with SMB protocol
Block Storage for volumes for your on-premise applications
AWS Volume Gateway
Storage Gateway to replace physical tapes on premises with virtual tapes stored in S3 and S3 Glacier
Tape Gateway
How to bring AWS into your data center as a rack for extreme low latency
AWS Outposts
AWS Outposts servie offerings
EC2, EBS, S3
Pricing options of AWS Outposts
3 years
All upfront, partial upfront, no upfront
S3 Object Lock: What is a legal hold?
Can be placed on a version and prevents overwritting or updating of a version
S3 Object Lock: What is a retention period?
Lock on a version of an object that prevents it for begin overwritten or updated for a specific period of time expires
Max number of SNS and SQS queues or lambda destinations to a bucket’s Amazon S3 events notification
1 maximum
If you need to process with 2 or more queues you will need to establish a fan out pattern
Scalable, high-available POSIX-compliant shared file system
EFS
Does EFS support windows?
No only linux
For Glacier setting that allow you to quickly access your data when occasional urgent requests for a subset of archives are required. If you need data in 1-5 min.
Expedited Retrieval
For Glacier, ensures that your retrieval capacity for expedited retrievals is available when you need it. Each unit of capacity provides that at least three expedited retrievals can be performed every five minutes and provides up to 150 MB/s of retrieval throughput
Provisioned Capacity
True or False, you pay for an EBS Volume only when it is attached to an EC2 instance
FALSE
S3, I want an another account to access my bucket, but I want them to have to pay for all of the data requests of data out of bucket. What can I do to set this up?
Enable the Requester Pays on the S3 bucket