Databases Section Flashcards
Relational Database
- Looks just like Excel spreadsheets, with links
* Can use the SQL language to perform queries / lookups
AWS RDS
- It’s a managed DB service for DB use SQL as a query language.
- It allows you to create databases in the cloud that are managed by AWS
RDS is a managed service:
• Automated provisioning, OS patching
• Continuous backups and restore to specific timestamp (Point in Time Restore)!
• Monitoring dashboards
• Read replicas for improved read performance
• Multi AZ setup for DR (Disaster Recovery)
• Maintenance windows for upgrades
• Scaling capability (vertical and horizontal)
• Storage backed by EBS (gp2 or io1)
• BUT you can’t SSH into your instances
Amazon Aurora
• Aurora is a proprietary technology from AWS (not open sourced)
• PostgreSQL and MySQL are both supported as Aurora DB
• Aurora is “AWS cloud optimized” and claims 5x performance improvement
over MySQL on RDS
RDS Deployment, Read Replicas
- Scale the read workload of your DB
- Can create up to 5 Read Replicas
- Data is only written to the main DB
RDS Deployment, Multi AZ
- Failover in case of AZ outage (high availability)
- Data is only read/written to the main database
- Can only have 1 other AZ as failover
RDS Deployment, Multi-Region (Read Replicas)
- Disaster recovery in case of region issue
- other AZs read from their DB but write on the main DB
- Local performance for global reads
- Replication cost
Amazon Elasticache
• ElastiCache is to get managed Redis or Memcached
• Caches are in-memory databases with high performance, low latency
• Helps reduce load off databases for read intensive workloads
• AWS takes care of OS maintenance / patching, optimizations, setup,
configuration, monitoring, failure recovery and backups
Dynamo DB
- Fully Managed Highly available with replication across 3 AZ
- NoSQL, serverless
- Single-digit millisecond latency – low latency retrieval
- Integrated with IAM for security, authorization and administration
- Low cost and auto scaling capabilities
- Key/value database
DynamoDB Accelerator - DAX
- Fully Managed in-memory cache for ONLY DynamoDB
* 10x performance improvement
DynamoDB Global Tables
- Make a DynamoDB table accessible with low latency in multiple-regions
- Active-Active replication (read/write to any AWS Region)
Redshift Overview
- PostGre SQL OLAP – online analytical processing (analytics and data warehousing)
- Load data once every hour, not every second
- 10x better performance than other data warehouses, scale to PBs of data
- Columnar storage of data (instead of row based)
- pay as you go
Amazon EMR
- EMR stands for “Elastic MapReduce”
- EMR helps creating Hadoop clusters (Big Data) to analyze and process vast amount of data
- The clusters can be made of hundreds of EC2 instances
- EMR takes care of all the provisioning and configuration
Amazon Athena
- Serverless query service to analyze data stored in Amazon S3
- Uses standard SQL language to query the files
Amazon QuickSight
- Serverless machine learning-powered business intelligence service to create interactive dashboards
- Integrated with RDS, Aurora,Athena, Redshift, S3
DocumentDB
- Aurora is an “AWS-implementation” of PostgreSQL / MySQL …
* DocumentDB is the same for MongoDB (which is a NoSQL database)