Storage Extras Flashcards
What is the snow family?
What are some of the challanges that the snow family helps overcome?
What is the rule of thumb when it comes to using a snow device?
Where does the data go when it is uploaded to AWS?
Highly Secure, portable devices to collect and process data at the edge and migrate data into and out of aws. They’re offline devices that allow us to perform data migrations
Time it takes to upload/download Limited connectivity/Connection stability Limited bandwidth Shared bandwidth (Can't maximize line) High network cost
If it will take more than a week to transfer your data.
To s3 buckets
What is snowball edge?
What are the two flavors of snowball edge?
What are the usecases for snowball edge?
Physical data transport solution to move TBs or PBs of data in or out of AWS.
Snowball Edge Storage Optimized - 80TB HDD for block volume and S3 compatible object storage
Snowball Edge Compute Optimized - 42 TB of HDD capacity for block volume and S3 compatible object storage
Large data cloud migrations, data center decommissions, disaster recovery up to 10PB
What is AWS snowcone?
How much storage do you get with snowcone?
When should you use snowcone?
What are the two ways that you can get data to sync from your snowcone to AWS?
Small portable computing anywhere, rugged and secure. Built to withstand harsh environments.
8TB storage
Use snowcone where snowball does not fit and up to 24 TB
You can ship it to AWS or you can connect it to a network and use aws datasync.
What is AWS snowmobile?
What is the storage capacity of AWS snowmobile?
What is a good use case for this?
It is a physical truck that comes to pick your data up. The truck is temperature controlled, has GPS and 24/7 security.
100PB
When you need to transfer more than 10PB of data
What is edge computing.
What are the show family devices that can be used for edge computing?
What are the usecases of edge computing?
Edge computing allows you to process data being created at an edge location that has limited/no internet access or limited/no access to computing power.
Snowball and snowcone
Preprocessing data
Machine learning at the edge
Transcoding media streams
What are the specs on a snowcone edge
Snowball edge compute optimized?
Snowball edge storage optimized.
What type of compute runs on these boxes?
How can you get discounted pricing of these devices?
2 CPUs 4gb ram. Wired or wireless access. USB-C power using a cord or an optional battery
52 vCPUs 208 GB of Ram, optional GPU (useful for video processing) and 42TB of storage
up to 40 vCPUs, 80GB ram. Object storage clustering available
EC2 instaces and lambda functions using AWS IoT Greengrass
By using longterm deployment options of 1-3 years.
What is AWS opshub?
You can use ops hub to
Unlock and configure single or clustered devices
Transfer files
Launch and manage instances running on snow family devices
Monitor device metrics (storage capacity, active instances etc)
Launch compatible AWS services on your devices
How can you migrate data from from the snow family to glacier?
By creating a lifecycle policy for your s3 bucket to move the data from the s3 bucket to glacier.
What problem is FSx for Windows solving?
So what is FSx?
The elastic file servic (the infinitely scalable netowrk atached drive) is built on top of a posix system which means it’s only available for Linux. This service solves that problem by making a “EFS” that is mountable by windows.
FSx is a fully managed windows file system share drive that supports SMB protocal and Windows NTFS? Thin “shared storage for windows”
Tell me about the directory service for FSx.
How does FSx Scale?
How can it be accessed?
Hoes does HA and Backups work for FSx?
It utilizes Micorsoft Active Directory integration, ACLs and user Quotas
It’s built on SSD to scale up to 10s of GB/s, millions of IOPS, 100s PB of data
From the cloud and from on prem infrastructure.
FSx can be configured to be Multi-AZ and the data is backed up daily to S3
What is Amazon FSx for Luster?
What are some of the use-cases for FSx for Luster?
How does scaling work for FSx for Luster?
How does FSx have a seamless integration with S3?
How can it be accessed?
Its a high performance computing cluster for linux that has a file system that is shared with high IOPS, high throughput, low latency and integration with S3 as a backend.
High performance computing, machine learning
Ex: Video processing, financial modeling, eletronic design automation
Scales up to 100s GBs, millions of IOPS, sub-ms latencies
It uses S3 as a filesystem
From the cloud and from on prem infrastructure.
What are the two file systems for FSx?
What is a scratch file system?
What is a persistent file system?
Scratch and Persistent.
Temporary Storage
Data is not replicated (Doesn’t persist if server fails
High burst (6x faster, 200mbps per TiB)
Used for short-term processing, optimize costs
Long term storage
Data replicated within same AZ
Replace failed files within minutes
Used for long-term processing, sensitive data
What is the AWS Transfer family?
How are you charged for the transfer family?
How does authentication work for the transfer family?
What is the use-case for the transfer family?
The AWS Transfer family gives you a way to add data to an S3 bucket or EFS though FTP (within vpc), SFTP, or FTPS. It is scalable, reliable, and highly available (multi-az)
You pay per provisioned endpoint per hours + data transfer in GB
You can store and manage users credentials witin the service, Integrate with an existting authentication system such as MSAD, LDAP, Okta, Amazon cognito, custom
When yo u just want to upload using ftp/sftp . Sharing files, public data sets, CRM ERP
What is AWS Storage Gateway?
What are the use-cases of AWS Storage Gateway?
What are the 3 types of storage gateway?
Storage gateway bridges your on-prem data and the storage in the cloud in S3.
Disaster recovery, backup and restore, tiered storage. Gives you a way to expand your NFS on-prem by leveraging amazon s3
Files gateway
Volume gateway
Tape gateway
How is your data stored using the file gateway?
What S3 types are supported?
How is access provided between the file gateway and buckets?
How does caching work for a file gateway?
How can you authenticate users who want to access files via the file gateway?
Configured S3 buckets are exposed via NFS and SMB protocol
Standard S3, S3 IA, S3 One Zone IA
IAM Roles.
The most recently used data is cached in the file gateway
Active directory authentication