Databases Flashcards

1
Q

What is an RDBMS (Aurora/Postgres) good for?

A

SQL, joins, OLTP etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is OLTP?

A

Online Transaction Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a NoSQL DB (Dynamo/Elasticache) good for?

A

key/value pairs, no joins, no SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is object store (S3) good for?

A

Big objects and data backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a data warehouse (RedShift/Athena) good for?

A

SQL analytics, business insights

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What 2 RDBMS is the Aurora API compatible with?

A
  • Postgres

- MySQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Where is the Aurora data stored?

A

It is stored in 6 replicas across 3 availability zones

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why would Aurora be Global?

A

It can be Global for disaster recovery or latency purposes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How big can the Aurora Auto Scaling storage be?

A

Between 10GB and 128GB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Aurora serverless good for?

A

Unpredictable or intermittent workloads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Aurora Multi-Master good for?

A

For continuous writes failover

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Elasticache?

A

It is a managed Redis or Memcached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is RedShift used for?

A

It is used for OLAP (Online Analytical Processing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does RedShift store data?

A

It is columnar storage, not row based

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does MPP stand for when talking about RedShift?

A

Massive Parallel Query Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How many nodes can RedShift have?

A

From 1 up to 128 nodes

17
Q

How much space does each node have in RedShift?

A

128TB

18
Q

What does the Leader node do in RedShift?

A

Its used for query planning and result aggregation

19
Q

What are the Compute nodes in RedShift used for?

A

They’re used for performing queries and sending results to the Leader node

20
Q

What is RedShift Spectrum?

A

Its used to perform queries directly on S3, no need to load

21
Q

What is RedShift Enhanced VPC Routing used for?

A

It’s used for COPY/UNLOAD foes through VPC rather than the public internet to S3

22
Q

How do you copy a RedShift snapshot to another AWS region?

A

You can configure RedShift to automatically copy snapshots of a cluster to another region

23
Q

What is AWS Glue used for?

A

Its used for ETL (Extract, transform and load)

24
Q

What is AWS Neptune?

A

It is a fully managed Graph database

25
Q

What apps would use a graph db?

A

A social media platform etc

26
Q

What has ElasticSearch been renamed to?

A

OpenSearch

27
Q

What makes up the ELK stack?

A

ElasticSearch, Kibana and Logstash