AWS Storage Extras - Datasync/Different Storage Options Flashcards
What is AWS DataSync?
AWS DataSync is a service for moving and synchronizing large amounts of data between on-premises, other cloud locations, and AWS services such as Amazon S3, EFS, or FSx. It supports scheduled tasks and preserves metadata and file permissions.
What AWS services can AWS DataSync synchronize data with?
Amazon S3 (all storage classes, including Glacier)
Amazon EFS
Amazon FSx (all file systems)
Is AWS DataSync a continuous or scheduled service?
AWS DataSync is scheduled, meaning tasks can run hourly, daily, or weekly, but not continuously.
Does AWS DataSync preserve file metadata and permissions?
Yes, AWS DataSync preserves metadata and file permissions, including compliance with NFS POSIX and SMB security models.
Can AWS DataSync work in reverse, from AWS to on-premises?
Yes, AWS DataSync can synchronize data from AWS storage services back to on-premises servers.
What alternative can you use if your network capacity is insufficient for DataSync?
You can use an AWS Snowcone device, which comes pre-installed with the DataSync agent. It collects data on-premises and is shipped to AWS for synchronization.
Can AWS DataSync synchronize data between AWS storage services?
Yes, AWS DataSync can synchronize data between services like Amazon S3, EFS, and FSx, while preserving metadata and permissions.
Amazon S3 (Object Storage)
Use Case: Store and retrieve unstructured data as objects (e.g., images, videos, backups).
Key Features: Highly scalable, secure, durable, and supports a variety of storage classes.
Archiving: Use S3 Glacier for cost-effective, long-term data archiving.
Amazon EBS (Block Storage)
Use Case: Attach storage to a single EC2 instance for persistent block-level storage.
Key Features:
Multiple volume types (e.g., GP3 for general-purpose, IO1/IO2 for high-performance).
Multi-Attach: Allows IO1/IO2 volumes to be attached to multiple EC2 instances.
EC2 Instance Store (Ephemeral Storage)
Use Case: Temporary, high-performance storage directly attached to EC2 instances.
Key Features:
Very high IOPS.
Data persists only during the lifecycle of the instance.
Amazon EFS (Network File System)
Use Case: Shared file storage for Linux-based workloads across multiple availability zones.
Key Features:
POSIX-compliant.
Automatically scales based on usage.
Amazon FSx (Specialized File Systems)
FSx for Windows File Server: For Windows-based workloads with SMB support.
FSx for Lustre: High-performance computing with Lustre file system for HPC and ML workloads.
FSx for NetApp ONTAP: Network file system with NetApp ONTAP compatibility.
FSx for OpenZFS: Managed ZFS file system for advanced storage features.
Databases
Use Case: Store structured data with indexing and querying capabilities.
Examples: RDS, DynamoDB, Aurora (covered in separate sections).