Databases Section Flashcards

1
Q

Relational Database

A
  • Looks just like Excel spreadsheets, with links

* Can use the SQL language to perform queries / lookups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

AWS RDS

A
  • It’s a managed DB service for DB use SQL as a query language.
  • It allows you to create databases in the cloud that are managed by AWS

RDS is a managed service:
• Automated provisioning, OS patching
• Continuous backups and restore to specific timestamp (Point in Time Restore)!
• Monitoring dashboards
• Read replicas for improved read performance
• Multi AZ setup for DR (Disaster Recovery)
• Maintenance windows for upgrades
• Scaling capability (vertical and horizontal)
• Storage backed by EBS (gp2 or io1)

• BUT you can’t SSH into your instances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Amazon Aurora

A

• Aurora is a proprietary technology from AWS (not open sourced)
• PostgreSQL and MySQL are both supported as Aurora DB
• Aurora is “AWS cloud optimized” and claims 5x performance improvement
over MySQL on RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

RDS Deployment, Read Replicas

A
  • Scale the read workload of your DB
  • Can create up to 5 Read Replicas
  • Data is only written to the main DB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

RDS Deployment, Multi AZ

A
  • Failover in case of AZ outage (high availability)
  • Data is only read/written to the main database
  • Can only have 1 other AZ as failover
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

RDS Deployment, Multi-Region (Read Replicas)

A
  • Disaster recovery in case of region issue
  • other AZs read from their DB but write on the main DB
  • Local performance for global reads
  • Replication cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon Elasticache

A

• ElastiCache is to get managed Redis or Memcached
• Caches are in-memory databases with high performance, low latency
• Helps reduce load off databases for read intensive workloads
• AWS takes care of OS maintenance / patching, optimizations, setup,
configuration, monitoring, failure recovery and backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Dynamo DB

A
  • Fully Managed Highly available with replication across 3 AZ
  • NoSQL, serverless
  • Single-digit millisecond latency – low latency retrieval
  • Integrated with IAM for security, authorization and administration
  • Low cost and auto scaling capabilities
  • Key/value database
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

DynamoDB Accelerator - DAX

A
  • Fully Managed in-memory cache for ONLY DynamoDB

* 10x performance improvement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

DynamoDB Global Tables

A
  • Make a DynamoDB table accessible with low latency in multiple-regions
  • Active-Active replication (read/write to any AWS Region)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Redshift Overview

A
  • PostGre SQL OLAP – online analytical processing (analytics and data warehousing)
  • Load data once every hour, not every second
  • 10x better performance than other data warehouses, scale to PBs of data
  • Columnar storage of data (instead of row based)
  • pay as you go
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Amazon EMR

A
  • EMR stands for “Elastic MapReduce”
  • EMR helps creating Hadoop clusters (Big Data) to analyze and process vast amount of data
  • The clusters can be made of hundreds of EC2 instances
  • EMR takes care of all the provisioning and configuration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Amazon Athena

A
  • Serverless query service to analyze data stored in Amazon S3
  • Uses standard SQL language to query the files
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Amazon QuickSight

A
  • Serverless machine learning-powered business intelligence service to create interactive dashboards
  • Integrated with RDS, Aurora,Athena, Redshift, S3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

DocumentDB

A
  • Aurora is an “AWS-implementation” of PostgreSQL / MySQL …

* DocumentDB is the same for MongoDB (which is a NoSQL database)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Amazon Neptune

A
  • Fully managed graph database
  • A popular graph dataset would be a social network
  • Highly available across 3 AZ, with up to 15 read replicas
  • Great for knowledge graphs (Wikipedia), fraud detection, recommendation engines, social networking
17
Q

Amazon QLDB

A
  • A ledger is a book recording financial transactions
  • Fully Managed, Serverless, High available, Replication across 3 AZ
  • Used to review history of all the changes made to your application data over time
  • Immutable and centralized
18
Q

Amazon Managed Blockchain

A
  • decentralized way to build applications where multiple parties can execute transactions
  • Compatible with the frameworks Hyperledger Fabric & Ethereum
19
Q

AWS Glue

A
  • Managed extract, transform, and load (ETL) service
  • Useful to prepare and transform data for analytics
  • Fully serverless service