Chapter 5 Flashcards
What would you use for Tabular data with a well defined schema?
AWS Relational Database Service
What would you use for analytics and reporting workloads that are heavy ?
A data warehouse like RedShift
What is the difference between RDS and Redshift?
RDS stores data using row-level storage where as Redshift uses column based.
What would you use for structured very large datasets?
RedShift
If you data is semi-structured what repository should you consider?
DynamoDB
How does DynamoDB store data?
As key-value pairs
What can you use to store data that doesn’t really have a schema?
DynamoDB
If you data currently lives in an open-source NoSQL store like MongoDB how can you migrate it easily to AWS?
Amazon DocumentDB
What would you use to centrally manage and govern data access across multiple repositories?
AWS Data Lake Formation
What tools could be used to run analytics or ETL workstreams on data in the data lake?
Amazon RedShift or EMR