Databases Flashcards

1
Q

What are DBs used for?

A

To stored structured data in AWS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are relational db’s used for?

A
  • Related data with SQL queries

- Can be scaled up

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are NoSql DB’s used for?

A
  • Flexible schemas
  • data not related between tables
  • Flexibility, scalability, very high performance
  • Can be scaled out <== Key
  • JSON data storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are AWS Managed Databases?

A
  • Very useful because the DB infrastructure is handled by AWS (Backups, Operations, Upgrades, Monitoring, Alerts)
  • Multi AZ for disaster recovery
  • Replicas, Dashboards
  • If you install a DB on EC2, you have to do all this
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is AWS RDS?

A
  • Relational Database Service
  • AWS Managed
  • Performance is better than a customer-managed database instance.
  • Read replicas for increased performance / horizontal scaling - can be multi-region
  • Multi-AZ for high availability - replicate to another AZ
  • Can choose from PostgreSQL, MySQL, Oracle, SQL Server, Aurora
  • Cannot SSH to DB instance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Amazon Aurora?

A
  • Proprietary AWS RDS
  • Cloud optimized
  • can auto-grow as data grows
  • no free tier
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ElastiCache?

A
  • In memory DB
  • High performance
  • Reduce load on databases by handing certain requests
  • AWS managed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is DynamoDB?

A
  • AWS managed NOSQL DB
  • Server-less
  • Scalable to massive workloads
  • Extremely low-latency retrieval - fast
  • DynamoDB Accelerator: Super Fast
  • Low cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is RedShift?

A
  • OLAP (On line Analytical Processing)
  • Analytics and Data Warehousing
  • Columnar data store
  • Can store huge amounts of data
  • SQL queries, dashboards
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is EMR?

A
  • AWS Elastic Map Reduce
  • Hadoop clusters - hundreds of EC2 instances
  • Process vast amounts of data
  • Use: data processing, machine learning, big data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Athena?

A
  • Server-less db to query S3 data

- Use: one-time SQL queries, S3 log analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is AWS Glue?

A
  • Serverless
  • ETL : extract, transform, load service
  • Data Catalog of datasets
  • Prepare and transform data for analytics
  • Data-wrangling - data from different sources (S3, RDS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is DMS?

A
  • Data Migration Service
  • Migrate data from 1 DB to another
  • Homogeneous - same to same (oracle to Oracle)
  • Heterogeneous - from one to another (SQL server to Aurora)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is AWS Quicksight?

A
  • Create dashboard for analytics

- Integrates with RDS, Aurora, Redshift….

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is DocumentDB?

A
  • a no-sql database

- based on MongoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Amazon Neptune?

A
  • managed graph database
17
Q

What is QLDB?

A
  • Quantum Ledger Database (financial transactions/ledger)
  • review history of all changes made to data
  • 1 organization
  • not decentralized, only 1 database - this is different from blockchain
18
Q

What is AWS Managed Blockchain?

A
  • blockchain: HyperLedger Fabric, Ethereum
  • decentralized
  • multi-organinzation?