Databases Flashcards

1
Q

What are disks vs. databases?

A

Disks: EFS, EBS, EC2 Instance Store, S3

Databases: RDS, Aurora, ElastiCache, DynamoDB, Redshift, EMR, Athena, QuickSight, DocumentDB, Neptune, QLDB, Managed Blockchain, DMS, Glue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a database let you do that a disk can’t?

A

Structure data, build indices to efficiently query/search data, and define relationships b/w your datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Benefits of no-SQL databases? What form do they take?

A

Flexible, scalable, high-performance, and high-functional

JSON

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the benefit of using AWS databases?

A

AWS manages them:

  • Quick Provisioning, High Availability, Vertical and Horizontal Scaling
  • Automated Backup & Restore, Operations, Upgrades
  • Operating System Patching is handled by AWS
  • Monitoring, alerting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are relational databases?

A

RDS & Aurora (SQL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In-memory Database

A

ElastiCache

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Key/Value Database? Can it be used serverless? What’s its caching mechanism?

A

DynamoDB (serverless)

DAX (cache for DynamoDB)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data warehouse

A

Redshift (SQL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hadoop Cluster

A

EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can you query data on Amazon S3?

What can it handle? (options below)
servers
severless
sql 
no-sql
A

Athena

Serverless and can handle SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a managed Hyperledger Fabric & Ethereum blockchains?

A

Amazon Managed Blockchain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Managed ETL (Extract Transform Load) and Data Catalog service

A

Glue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

DMS & traits?

A

Database migration system - allows intervals, quickly and securely migrates DB to AWS in a resilient and self healing, while allowing source database to remain available during the migration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Graph database

A

Neptune

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

dashboards on your data (serverless)

A

QuickSight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

“Aurora for MongoDB” (JSON – NoSQL database)

A

DocumentDB

17
Q
What allows you to create databases managed by AWS for the following:
• Postgres
• MySQL
• MariaDB
• Oracle
• Microsoft SQL Server
• Aurora (AWS Proprietary database)
A

RDS

18
Q

What are the pros & cons to RDS vs. DB on EC2?

A

RDS is a managed service:
• Automated provisioning, OS patching
• Continuous backups and restore to specific timestamp (Point in Time Restore)!
• Monitoring dashboards
• Read replicas for improved read performance
• Multi AZ setup for DR (Disaster Recovery)
• Maintenance windows for upgrades
• Scaling capability (vertical and horizontal)
• Storage backed by EBS (gp2 or io1)
• BUT you can’t SSH into your instances

19
Q

What are supported by Aurora DB (which is cloud-optimized)?

A

PostgreSQL and MySQL

20
Q

What are 3 unique features of RDS deployments?

A
  1. Can read replicas to scale the read workload of your DB (1 app can read from 3 Amazon RDS - 1 main/2 replicas)
  2. Can do multi-AZ and failover in case of AZ outage (high availability); read/write to main RDS but rep across AZ to 1 other RDS
  3. Can do multi-region; if disaster in 1 region, can do local performance for global reads, ensure disaster recovery in case of region issues, but charge rep cost
21
Q

What do you use ElastiCache for?

A

To get managed Redis or Memcached; caches are in-memory DB and allows reducing load of DB for read-intensive wkloads; allows quicker read/write from cache

22
Q

What is a noSQL DB that can scale to massive workloads due to its distributed serverless DB with single-digit milisecond latency?

A

DynamoDB

23
Q

What is a fully managed in-memory cache for DynamoDB that improves performance by 10X?

A

DAX

24
Q

What is a postgreSQL based OLAP that has columnar storage of data?

A

Redshift

25
Q

What helps create Hadoop clusters (Big Data) to analyze and process vast amount of data, where clusters could be hundreds of EC2 instances; and is used for data processing, machine learning, web indexing, and big data?

A

Elastic MapReduce

26
Q

What is a fully Serverless database with SQL capabilities?

A

Athena

27
Q

What are the use cases of Athena?

A

one-time SQL queries, serverless queries on S3, log analytics

28
Q

What is the relationship between S3 and Athena?

A

Query data in S3 and get output to S3

29
Q

What is the pricing of Athena?

A

Pay per query

30
Q

What is a serverless machine learning-powered business intelligence service to create interactive dashboards?

A

Amazon QuickSight

31
Q

What is AWS version of a NoSQL database for storing, querying, and indexing JSON data?

A

DocumentDB

32
Q

What is a fully managed graph database (e.g. social network) that is highly available (3 AZ) and up to 15 read replicas?

A

Amazon Neptune

33
Q

What can be used to review history of all the changes made to your application data over time, and is immutable?

A

Quantum ledger database

34
Q

How is QLDB different from Amazon Managed Blockchain?

A

No decentralization component, in accordance with

financial regulation rules

35
Q

What is a serverless AWS svc used to manage extract, transform, and load (ETL), i.e. useful for preparing and transforming data for analytics?

A

AWS Glue

36
Q

What can Glue Data Catalogue be used by?

A

Athena, Redshift, EMR (Elastic MadReduce)

37
Q

What svc can be used for quickly and securely migrating databases to AWS, resilient, self healing?

A

Database Migration Svc (DMS)