AWS DataSync Flashcards
What is AWS Datasync
It enables organizations to securely and efficiently transfer large amounts of data between on-premises storage systems, AWS storage services, and other cloud storage providers.
Can move from AWS to AWS resources (s3 to EFS) without an agent needed. But from on-premises to AWS, you will need to install the agent into on premises
It can synchronize to S3, EFS, and FSx
Replication tasks can be scheduled hourly, daily and weekly. Note it is not continuous, you have to schedule it
It will preserve the file permissions and metadata of your file when moving them from on place to another
What can you do if you want to run Datasync but dont have the network capacity to do so
You can use AWS Snowcone, because it has Datasync agent pre-installed in it
How do you connect AWS Datasync privately from an on-premises server
You use direct connect to connect the datasync agent in on-premises to AWS… then in AWS you can create a privatelink which has a VPC endpoint, then AWS Datasync can connect to that VPC endpoint which will connect to Direct Connect
What is AWS Data Exchange
It allows you to find, subscribe to and use third-party data in the cloud. Once subscribed, you can use AWS Data Exchange API to load data directly into S3 and analyze it.
There is also AWS Data Exchange for Redshift which you can use to do the same but use Redshift to query the data
There is also AWS Data Exchange for APIs which you can use to find and subscribe to 3rd party APIs
Example are: Reuters, Foursquare and so on