Migration - Datasync Flashcards
What is datasync?
online data transfers that simplify and automate copying of large amounts of data from on-prem to AWS storage systems and between AWS storage systems
What type of data does Datasync copy?
between NFS, SMB, EFS, S3, FSx (windows file server) and self managed object storage
Why use AWS Datasync?
copy large datasets without the need for custom solutions to AWS, archive data from on-prem; replicate data to AWS for BC and transfer data for analysis and processing
What problem does datasync solve?
reduces complexity and cost of online data transfer; making it simple to transfer datasets between on-prem and AWS storage services
How does it work?
- deploy an agent and associate to an AWS account via MC or API. 2. Create data transfer task from data source to destination. 3. start transfer and monitor it with CloudWatch
What is an agent?
VM used to read/write data to and from on-prem location
What is a location?
any source or destination used in transfer
What is a task?
includes 2 locations and how data is transferred between the 2 locations. The conf settings include how data is treated
What is task execution?
An individual run of a task; includes start time, end time, bytes written and status
Which protocol does it use to transfer data?
AWS-designed transfer protocol, decoupled from storage protocol to increase data transfer
How fast is it?
a single task up to 10 GBps on a network link
Is data encrypted?
all data is transferred using TLS; S3 SSE-S# and EFS encryption for data at rest
Datasync vs Storage gateway
Storage gateway on-prem access to S3; Datasync transfers large amounts of data to and from AWS services over the internet or Direct Connect
How much does it cost?
pay for the amount of data you copy; charged for request; storage, and data transfer
What are some limitation?
100 max tasks per region; 50 Mil files per task; 10 GBps throughput