Storage Flashcards
Types of storage in AWS
Block
File
Object
Block Storage
Examples
dedicated, low latency storage for each host
Raw storage. Data organized as
an array of unrelated blocks.
This storage is similar to direct attached storage (DAS) or a
storage area network (SAN).
Block based cloud storage are provisioned with each virtual server and offer the ultra low latency required for high performance
workloads.
Examples:
hard disk, storage
area network (SAN), storage
arrays, EBS
File storage
Examples
Unrelated data blocks managed by a file serving system. Native file system places data on disk
Examples: Network Attached Storage (NAS) appliances, Windows File servers, EFS, FSx
Object storage
Examples
Stores virtual containers that
encapsulate the data, data
attributes, metadata, and object
IDs.
Examples:
Ceph, OpenStack Swift, S3, S3 Glacier
Amazon
EFS
Amazon EFS provides a scalable, elastic file system for Linux based workloads for use with AWS Cloud services and on premises resources.
You can create a file system, mount it on an Amazon EC2 instance, and then read and write data to and from
your file system. You can mount an Amazon EFS file system in your VPC through the Network File System (NFS) versions 4.0 and 4.1 (NFSv4) protocol. You do not need to take action to expand the file system as your storage needs grow.
EC2 instances in your VPC can access Amazon EFS file systems concurrently, so applications that scale beyond a single connection can access a file system.
Amazon EFS Availability & Durability
Availability and durability refer to the redundancy with which an Amazon EFS file system stores data within an
AWS Region. You have the following choices for your file system’s availability and durability:
*
Standard storage - creates a file system that stores file system data and metadata redundantly
across all Availability Zones within an AWS Region
*
One Zone storage class - lass creates a file system that stores file system data and metadata
redundantly within a single Availability Zone - only allowed a single mount target
EFS Benefits
Scale throughput based on storage use or provision throughput independent of storage
Automatically grows and shrinks storage without provisioning
Managed services so lower total cost of ownership (TCO)
FSx
launch, run, and scale feature rich and high performing file systems.
provides 4 file systems to choose from:
FSx for Windows
FSx for Lustre
FSX for NetApp ONTAP
FSx for OpenZFS
FSx for Windows
File Server provides fully managed Microsoft Windows file servers that are backed by a native
Windows file system
Built on Windows Server, Amazon FSx delivers a wide range of administrative features
such as data deduplication, end
user file restore, and Microsoft Active Directory
FSx for Lustre
FSx for Lustre is a fully managed service that provides high performance, cost effective storage. FSx for Lustre is compatible with the most popular Linux based AMIs.
FSX for NetApp ONTAP
fully managed shared storage in the AWS Cloud with the popular data access and management capabilities of ONTAP
FSx for OpenZFS
provides fully managed shared file storage built on the OpenZFS file system. It is powered by
the AWS Graviton family of processors, and accessible through the NFS protocol
AWS Data Migration Tools, Online vs Offline?
Online
AWS Storage Gateway
AWS DataSync
AWS Transfer Family
Offline
AWS Snowball Edge
AWS Storage Gateway
Seamlessly connect and extend your on premises applications to AWS storage. It supports multiple file
transfer protocols: Server Message Block (SMB), Network File System (NFS), and Internet Small Computer
Systems Interface (iSCSI).
Sync files with SMB, NFS, and iSCSI protocols from on premises to AWS.
AWS DataSync
a data transfer service that facilitates moving data between on-premises storage and Amazon EFS, Amazon FSx, and Amazon S3
By default, data is encrypted in transit by using TLS 1.2
Sync files from on-premises file storage to Amazon EFS, Amazon FSx, and Amazon S3
DataSync deploys as a single software agent that can connect to multiple shared file systems and run multiple
tasks.
AWS Transfer Family
Transfer files into and out of Amazon S3 or EFS with SFTP, FTPS, FTP, and AS2
AWS Snowball Edge
Device with on board storage and compute power for select AWS
capabilities. Snowball Edge can process data locally, run edge computing workloads, and transfer data to or from the AWS Cloud
Includes Snowball Edge and Snowball Edge Optimized
Move petabytes of data to AWS by using appliances that are designed for secure, physical transport
create a job in the console, and a Snowball appliance will
be shipped to you
Storage Gateway types
Amazon S3 File Gateway
Amazon FSxfor Windows File Server
Tape Gateway
Volume Gateway
Amazon S3 File Gateway
Amazon S3 File Gateway-
native file interface to store files as objects in Amazon S3. use NFS and SMB file protocols.
Amazon FSxfor Windows File Server
Amazon FSxfor Windows File Server -provides fully managed Microsoft Windows file servers, backed by a fully native Windows file system.
FSxfor Windows File Server has the features, performance, and compatibility to easily lift and shift enterprise applications to the AWS Cloud.
Tape Gateway
Tape Gateway - presents an iSCSI
-based virtual tape library (VTL) of virtual tape drives and a virtual media
changer to your on-premises backup application. Tape Gateway stores your virtual tapes in Amazon S3 and
creates new ones automatically
Volume Gateway
Volume Gateway - presents block storage volumes of your applications by using the iSCSI protocol. You can
asynchronously back up data that is written to these volumes as point-in-time snapshots of your volumes. Then, store it in the cloud as Amazon EBS snapshots.
What protocols does AWS Storage Gateway support for on prem data
NFS or SMB for files
iSCSI for volumes
iSCSI VTL for tapes
What destinations can AWS Storage Gateway move data to?
*Amazon S3 (Amazon S3 File Gateway, Tape Gateway)
*Amazon S3 Glacier (Amazon S3 File Gateway, Tape Gateway)
*Amazon FSx for Windows File Server
*Amazon EBS (Volume Gateway
Snowball Edge Optimized
Snowball Edge Optimized
is ideal for edge processing use cases that require additional computing, memory, and
storage power in remote, disconnected, or harsh environments.