Migration Flashcards
Service used to replicate large amounts of data from on-premise to AWS
AWS DataSync
Compatible systems with AWS DataSync
NFS and SMB
Units of time that AWS DataSync can replicate data
Hourly, Daily, or Weekly
How to use AWS DataSync
Install DataSync agent to start replication (typically on premise) or can be used to replicate EFS to EFS by installing on EC2 agent.
Sources that AWS DataSync can replicate to?
on-premise
EFS to EFS (install on EC2 agent)
S3, EFS, FSx for Windows ect.
AWS Service for big disk transfers of data into and out of AWS up to 80TB.
Snowball edge
What range of size Snowball edge come in?
48TB
81TB
How does AWS Snowball edge protect your data?
Tamper-resistant enclosure
256 encryption
TPM
Completely wipe data
Common uses for AWS Snowball edge?
Import data to S3
Export data from S3
AWS Service used to store and perform large computations of up to 100TB on prem?
Snowball Edge
AWS Service used to store and perform large computations of up to 8TB on prem?
Snowcone
AWS Service used to transfer HUGE amounts of data up to 100PB
Snowmobile
What is AWS Storage Gateway used for?
Connect on-premise software appliance with cloud-based storage
Virtual or physical device send stuff to AWS
Can download as VM and install in datacenter
What are the 3 types of Storage Gateways?
File Gateway
Volume Gateway
Tape Gateway
Describe Storage Gateway’s File Gateway
NFS & SMB
For Flat files
Store stuff in S3 access through NFS mount point connects via Direct Connect, VPC, or internet and then connect to S3.
Can act as a file system mount on S3
Describe Storage Gateway’s Volume Gateway (iSCSI)
STORED VOLUMES -
Store locally, durable off site backups
Entire dataset
Snapshot of Volume set to S3
CACHED VOLUMES-
Cache frequently used data stored on site
Entire dataset sent to s3
VITUAL HARD DISK - async backup of volumes stored in cloud as EBS snapshots
Describe Storage Gateway’s Tape Gateway
Virtual Tape Library
Way of getting rid of tapes (virtual tape cartridges)
AWS Service that is an interactive query service which enables you to analyse and query data located in S3 using standard SQL.
Athena
Uses for AWS Athena
Used to query log files in S3 (common topic)
Generate business reports on data stored in S3
Analyse AWS cost and usage reports
Run queries on click-stream data
Data types AWS Athena Allows
JSON
Apache ORC
Apache Parquet
TRUE or FALSE: AWS Athena is Serverless
TRUE
What do you pay for In Athena?
Pay per query / per TB scanned
Difference between S3 Select and Athena?
Athena is more powerful than S3 Select and therefore costs $$
AWS ML Service for analyzing S3 for PII
Macie
AWS Orchestration Service for running multi node parallel jobs that span multiple EC2 instances
AWS Batch
AWS Orchestration Service to Easily deploy and manage HPC clusters and automate creation
AWS ParallelCluster
What model does AWS Parallel Cluster use to provision the reosources you need
text file model
How to use GraphQL in AWS
AWS AppSync
What is GraphQL
Used for fetching app data, declarative coding, front-end data app fetching
Service to use when you have an older app using lecacy transfer protocols that can’t be changed but want to start storing files in S3
AWS Transfer Family
GUI service that integrates with SMS and DMS migration services to help schedule and track migrations
AWS Migration Hub
2 Discovery types offered by AWS Discovery Service
Agentless - use OVA file via vCenter to help identify hosts and collect IP, memory, and hostname to help simplify migration
Agent Based - AWS Application Discover Agent installed on each service (for Windows and Linux) & collects a bit more data than agentless
Service that provides automated lift and shift into AWS through replication & move into AWS
AWS MGN (Application Migration Service)
What can you use SCT (Schema Conversion Tool) on
RDS
Aurora
Redshift
EC2
S3