Databases and Analytics Flashcards
What is Amazon RDS?
Amazon RDS (Relational Database Service) is a managed DB service for DBs that usse SQL as a query language.
What is Amazon Aurora?
Aurora is a proprietary technology from AWS that supports PostgreSQL and MySQL as a DBMS.
Cloud Optimised Service
What is a read replica for RDS?
Allows you to scale the read workload of your DB.
Can create up to 15 Read Replicas, but remember that data is only written to the main DB.
If you have a multi-region RDS deployment, do local applications write to the RDS read replica or to the main RDS DB?
Read replicas allow faster performance for applications in the same region.
Writes are done by applications to the main RDS DB, not the read replica.
What is Amazon ElastiCache?
Amazon ElastiCache provides managed Redis or Memcached. These are in-memory databases.
What Amazon service would you use if you needed an in-memory database with high performance and low latency?
Amazon ElastiCache.
What is the purpose of a Cache?
Caches help to reduce load off of databases to reduce read intensive workloads.
What is DynamoDB?
DynamoDB is a fully managed and highly available NoAQL database that is fast (low latency), massively scalable and serverless.
What is another name for DAX?
DynamoDB Accelerator.
What is DynamoDB Accelerator?
A fully managed, in memory-cache for DynamoDB.
If you need a cache for DynamoDB, what service would you use and why?
You would use DynamoDB Accelerator.
This is because it is purpose built for DynamoDB and is fully integrated into the database service.
What does OLAP stand for?
Online Analytical Processing
What is Redshift?
What are the use cases for Redshift?
Database based on PostgreSQL for OLAP.
Redshift is used for Data Analytics and Data Warehousing.
How often is data loaded into Redshift?
Every hour.
Does Redshift support visualization?
Yes it integrates with BI tools like AWS QuickSight or Tableau.
What does Amazon EMR stand for?
Amazon MapReduce.
What is Amazon EMR used for?
EMR helps create (provision/configure) Hadoop clusters to analyse and process large amounts of data.