Database and Analytics Flashcards
RDS and Aurora
SQL Databases and Aurora is AWS proprietery DB (Postgre SQL and MySQL)
Multi AZ setup for DR
Verticle and Horizontal scaling
You CANNOT SSH into your instance
RDS Deployment
Read Replicas - Read from more than 1 instance and write to only 1
Multi-AZ - Read and Write from 1 and on Failover move to another
Multi Region - Region 1(Main) reads/writes on Region 1 instance and Region 2 reads on Region 2 but writes to Region 1
ElastiCache
In Memory DB for faster read/write for all databases
DynamoDB
No SQL DB Managed DB
Serverless
Standard and IA Table Class
DynamoDB Accelerator(DAX)
In Memory Cache for faster read/write for Dynamo DB
DynamoDB Global Tables
It’s a way for you to make DynamoDB table accessible with low latency in multiple regions
RedShift
Serverless(Pay for what you use) and based on Postgre SQL
Only for OLAP and not OLTP
Used for Data warehousing
Uses Massively Paraller Processing Query (MPP)
Integrate with BI tools like AWS QuickSight or Tableau
EMR (Elastic Map Reduce)
Helps create Hadoop clusters(Big Data)
Clusters are many EC2 instances
Autoscaling and integrated with spot instances
ML, Data Processing, Web Indexng, Big data
Athena
Serverless and perform anaytics on S3
Uses SQL
Used for BI, Analytics, Anaylze VPC Flow Logs, CloudTrail, ELB Logs
QuickSight
Serverless to create interactive dashboards
Integrated with all DBs
DocumentDB
Aurora for MongoDB
No SQL DB
Neptune
Fully Managed Graph DB
Social Media data
QLDB
Quantum Ledger DB
Recording Financial Transactions
To review history of all the changes made to your application data over time
Central authority
Managed Blockchain
Joins public blockchains like hyperledger fabric or Ethereum
No central authority
Glue
Serverless and does ETL
Take data from S3 or RDS, Transform and load into RedShift