Snow Family Flashcards
Highly-secure, portable devices to collect and process data at the edge,
and migrate data into and out of AWS
AWS Snow Family
What are 2 process you can use AWS Snow Family for??
Data Migration
Edge Computing
What are the 3 Snow Family Products for Data Migration
Snowcone
Snowball Edge
Snowmoblie
What are the 2 Snow Family Products for Edge Computing
Snowcone
Snowball Edge
What are the 5 challenges of Data Migrations that the AWS Snow Family can help with??
- Limited connectivity
- Limited bandwidth
- High network cost
- Shared bandwidth (can’t maximize the line)
- Connection stability
offline devices to perform data migrations
AWS Snow Family
If it takes more than a ______ to transfer over the network, use Snowball devices!
week
- Physical data transport solution:moveTBs or PBs of data in or out of AWS
- Alternative to moving data over the network (and paying network fees)
- Pay per data transfer job
- Provide block storage and Amazon S3-compatible object storage
Snowball Edge (for data transfers)
What are the 2 flavors of Snowball Edge??
Storage Optimized
Compute Optimized
80 TB of HDD capacity for block volume and S3 compatible object
storage
Snowball Edge Storage Optimized
42 TB of HDD or 28TB NVMe capacity for block volume and S3 compatible object storage
Snowball Edge Compute Optimized
What are 3 use cases for Snowball Edge?
large data cloud migrations
DC decommission
disaster recovery
- Small, portable computing, anywhere, rugged & secure, withstands harsh environments
- Light (4.5 pounds, 2.1 kg)
- Device used for edge computing, storage, and data
transfer
AWS Snowcone
2 types of Snowcones?
Snowcone
Snowcone SSD
8 TB of HDD Storage
Snowcone
14 TB of SSD Storage
Snowcone SSD
Where to use Snowcones??
where Snowball does not fit (space- constrained environment)
For Snowcones, do you have to provide your own battery / cables?
YES
Snowcones, can be sent back to ________, or connect it to
internet and use _________ to send data
AWS offline
AWS DataSync
- Transfer exabytes of data (1 EB = 1,000 PB = 1,000,000 TBs)
- Each Snowmobile has 100 PB of capacity (use multiple in parallel)
- High security: temperature controlled, GPS, 24/7 video surveillance
- Better than Snowball if you transfer more than 10 PB
AWS Snowmobile
What are the 6 steps to the Snow Family – Usage Process
- Request Snowball devices from the AWS console for delivery
- Install the snowball client / AWS OpsHub on your servers
- Connect the snowball to your servers and copy files using the client
- Ship back the device when you’re done (goes to the right AWS facility)
- Data will be loaded into an S3 bucket
- Snowball is completely wiped
Snow Family – Edge Computing -
* 2 CPUs, 4 GB of memory, wired or wireless access
* USB-C power using a cord or the optional battery
Snowcone
Snowcone SSD (smaller)
Snow Family – Edge Computing -
* 104 vCPUs, 416 GiB of RAM
* Optional GPU (useful for video processing or machine learning)
* 28 TB NVMe or 42TB HDD usable storage
Snowball Edge – Compute Optimized
Snow Family – Edge Computing -
* Up to 40 vCPUs, 80GiB of RAM, 80TB storage
* Object storage clustering available
Snowball Edge – Storage Optimized
Can you run EC2 Instances and AWS Lambda Functions on Snow Family?
YES
What AWS product is used to run EC2 Instances and AWS Lambda Functions on Snow Family?
AWS IoT Greengrass
What are the 2 long-term deployment options for discount pricing on Snow Family Products
1 and 3 years
What AWS software do you use to manage Snow Family Devices?
AWS OpsHub
- Unlocking and configuring single or clustered devices
- Transferring files
- Launching and managing instances running on Snow Family Devices
- Monitor device metrics (storage capacity, active instances on your device)
- Launch compatible AWS services on your devices (ex: Amazon EC2 instances, AWS DataSync, Network File System (NFS))
AWS OpsHub
Can you import from Snowball directly into S3 Glacier??
NO
How do you import from Snowball into S3 Glacier?
You must use Amazon S3 first, in combination with an S3 lifecycle policy
- Launch 3rd party high-performance file systems on AWS
- Fully managed service
Amazon FSx
What are the 4 file systems supported on FSx?
Lustre
Windows File Server
NetApp ONTAP
OpenZFS
- FSx for Windows is a fully managed Windows file system share drive
- Supports SMB protocol & Windows NTFS
- Microsoft Active Directory integration, ACLs, user quotas
- Can be mounted on Linux EC2 instances
- Supports Microsoft’s Distributed File System (DFS) Namespaces (group files across multiple FS)
Amazon FSx for Windows (File Server)
Amazon FSx for Windows (File Server) - Scale up to ____ of GB/s, _______ of IOPS, _______ PB of data
10s
millions
100s
Amazon FSx for Windows (File Server) Storage Options
- SSD – latency sensitive workloads (databases, media processing, data analytics, …)
- HDD – broad spectrum of workloads (home directory, CMS, …)
Can Amazon FSx for Windows (File Server) be accessed from your on-premises infrastructure????
YES
(VPN or Direct Connect)
Can Amazon FSx for Windows (File Server) be configured to be Multi-AZ (high availability)?????
YES
How ofter are Amazon FSx for Windows (File Server) data backups done?
Daily to S3
- Machine Learning, High Performance Computing (HPC)
- Video Processing, Financial Modeling, Electronic Design Automation
Amazon FSx for Lustre
Amazon FSx for Lustre - Scales up to _____ GB/s, ________ of IOPS, _______ latencies
100s
millons
sub-ms
Amazon FSx for Lustre Storage Options
- SSD – low-latency, IOPS intensive workloads, small & random file operations
- HDD – throughput-intensive workloads, large & sequential file operations
Amazon FSx for Lustre - Can “read S3” as a file system (through FSx)
READ THAT AGAIN
Amazon FSx for Lustre - Can write the output of the computations back to S3 (through FSx)
READ THAT AGAIN
Can Amazon FSx for Lustre be used from on-premises servers??????
YES
(VPN or Direct Connect)
FSx Lustre - File System Deployment Options (2)
Scratch File System
Persistent File System
FSx Lustre - File System Deployment Options -
* Temporary storage
* Data is not replicated (doesn’t persist if file server fails)
* High burst (6x faster, 200MBps per TiB)
* Usage: short-term processing, optimize
costs
Scratch File System
FSx Lustre - File System Deployment Options -
* Long-term storage
* Data is replicated within same AZ
* Replace failed files within minutes
* Usage: long-term processing, sensitive data
Persistent File System
- File System compatible with NFS, SMB, iSCSI protocol
- Move workloads running on ONTAP or NAS to AWS
- Works with:
- Linux
- Windows
- MacOS
- VMware Cloud on AWS
- Amazon Workspaces & AppStream 2.0 * Amazon EC2, ECS and EKS
- Storage shrinks or grows automatically
- Snapshots, replication, low-cost, compression and data
- Point-in-time instantaneous cloning (helpful for testing new workloads)
Amazon FSx for NetApp ONTAP
- Managed OpenZFS file system on AWS
- File System compatible with NFS (v3, v4, v4.1, v4.2)
- Move workloads running on ZFS to AWS
- Works with:
- Linux
- Windows
- MacOS
- VMware Cloud on AWS
- AmazonWorkspaces&AppStream2.0
- AmazonEC2,ECSandEKS
- Up to 1,000,000 IOPS with < 0.5ms latency
- Snapshots, compression and low-cost
- Point-in-time instantaneous cloning (helpful for testing new workloads)
Amazon FSx for OpenZFS
How do you expose the S3 data on-premises?
AWS Storage Gateway
What are 4 reasons to use Hybrid Cloud for storage .. aka AWS Storage Gateway
- disaster recovery
- backup & restore
- tiered storage
- on-premises cache & low-latency files access
4 Types of Storage Gateway
- S3 File Gateway
- FSx File Gateway
- Volume Gateway
- Tape Gateway
- Configured S3 buckets are accessible using the NFS and SMB protocol
- Most recently used data is cached in the file gateway
- SupportsS3Standard,S3StandardIA,S3OneZoneA,S3IntelligentTiering
- Transition to S3 Glacier using a Lifecycle Policy
- Bucket access using IAM roles for each File Gateway
- SMB Protocol has integration with Active Directory (AD) for user authentication
Amazon S3 File Gateway
- Native access to Amazon FSx for Windows File Server
- Local cache for frequently accessed data
- Windows native compatibility (SMB, NTFS, Active Directory…)
- Useful for group file shares and home directories
Amazon FSx File Gateway
- Block storage using iSCSI protocol backed by S3
- Backed by EBS snapshots which can help restore on-premises volumes!
- Cached volumes: low latency access to most recent data
- Stored volumes: entire dataset is on premise, scheduled backups to S3
Volume Gateway
- Some companies have backup processes using physical tapes (!)
- With Tape Gateway, companies use the same processes but, in the cloud
- VirtualTape Library (VTL) backed by Amazon S3 and Glacier
- Back up data using existing tape-based processes (and iSCSI interface)
- Works with leading backup software vendors
Tape Gateway
- Works with File Gateway,Volume Gateway,
- Has the required CPU, memory, network, SSD cache resources
- Helpful for daily NFS backups in small data centers
Tape Gateway
Storage Gateway – Hardware appliance
Look at slide 367
A fully-managed service for file transfers into and out of Amazon S3 or Amazon EFS using the FTP protocol
* Managed infrastructure, Scalable, Reliable, Highly Available (multi-AZ)
* Pay per provisioned endpoint per hour + data transfers in GB
* Store and manage users’ credentials within the service
* Integrate with existing authentication systems (Microsoft Active Directory, LDAP, Okta, Amazon Cognito, custom)
* Usage: sharing files, public datasets, CRM, ERP, …
AWS Transfer Family
AWS Transfer Family - Supported Protocols
- AWS Transfer for FTP (File Transfer Protocol (FTP))
- AWS Transfer for FTPS (File Transfer Protocol over SSL (FTPS))
- AWS Transfer for SFTP (Secure File Transfer Protocol (SFTP))
- Move large amount of data to and from
- On-premises / other cloud to AWS (NFS, SMB, HDFS, S3 API…) – needs agent * AWS to AWS (different storage services) – no agent needed
- Can synchronize to:
- Amazon S3 (any storage classes – including Glacier)
- Amazon EFS
- Amazon FSx (Windows, Lustre, NetApp, OpenZFS…)
- Replication tasks can be scheduled hourly, daily, weekly
- File permissions and metadata are preserved (NFS POSIX, SMB…)
- One agent task can use 10 Gbps, can setup a bandwidth limit
AWS DataSync
Object Storage
S3
Object Archival
S3 Glacier
Network storage for one EC2 instance at a time
EBS volumes
Physical storage for your EC2 instance (high IOPS)
Instance Storage
Network File System for Linux instances, POSIX filesystem
EFS
Network File System for Windows servers
FSx for Windows
High Performance Computing Linux file system
Sx for Lustre
High OS Compatibility
Sx for NetApp ONTAP
Managed ZFS file system
FSx for OpenZFS
S3 & FSx File Gateway,Volume Gateway (cache & stored),Tape Gateway
Storage Gateway
FTP, FTPS, SFTP interface on top of Amazon S3 or Amazon EFS
Transfer Family
Schedule data sync from on-premises to AWS, orAWS to AWS
DataSync
to move large amount of data to the cloud, physically
Snowcone / Snowball / Snowmobile
for specific workloads, usually with indexing and querying
Database