Databases Flashcards

1
Q

Is RDS OLTP or OLAP?

A

RDS is OLTP as it’s row-based and not made for data warehousing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What OLTP Databases are available on AWS?

A
MySQL
MariaDB
Aurora
PostgreSQL
Oracle
SQL Server
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What OLAP service is there on AWS?

A

Redshift (dataware housing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the difference between OLAP and OLTP?

A

OLAP is for analytics, OLTP is for transaction processing.

OLAP uses columnar architecture to achieve high IO performance (same data type in a data column) as well as very good compression. Queries can span multiple tables and process/join multiple sources of data which can be very intensive.

OLTP uses row-based architecture which can be easily queried without spanning too many databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is AWS Redshift?

A

Redshift is a data warehousing service for AWS. It has fast performance and high storage capacity. It is a columnar type architecture with multi-parallel processing and advanced data compression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Is AWS Redshift encrypted?

A

At rest encryption: AES-256

In transit encryption: SSL/TLS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Multi-AZ in an RDS Instance?

A

Multi-AZ is used to synchronize your database writes to a copy in another AZ so that it can act as a failover replica in the case your primary database fails.

Used for High Availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Read-Replicas in an RDS instance?

A

Read-Replicas are read-only replicas of your primary database that you can use to gain performance/separate reads from your primary database.

This is used to gain a performance advantage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the available database services in AWS?

A

OLAP: Redshift
OLTP: RDS – MySQL, SQL Server, PostgreSQL, Oracle, Aurora, MariaDB

In-Memory: Elasticache (redis, memcached)

No-SQL: DynamoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does Aurora Scaling/Failover work?

A

Two copies of your database in each AZ. Each Aurora instance has a minimum of three AZs (6 copies of physical storage media)

Designed to transparently handle the loss of 2 copies of data without affecting database write availability and up to three copies without affecting read availability

Self-healing, data blocks and disks are continuously scanned for errors and repaired automatically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How many copies of data can Aurora handle?

A

Aurora is able to handle up to 2 copies of lost data without affecting write operations and up to 3 copies of data without affecting read availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What stops an Aurora instance from suffering data corruption/drive corruption?

A

Self-healing. Aurora constantly scans for errors and repairs them automatically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Difference between scaling DynamoDB vs scaling RDS?

A

DynamoDB can scale with “push button scaling” aka you click a button and you can increase the performance (WCU,RCU) units and size. No downtime required.

With an RDS instance you have to use read-replicas to scale out and eventually hit a limit or use a bigger instance type.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the storage consistencies of Dynamo DB?

A

Eventual Read Consistency (default) : Can take 1 second to reach consistency after a write. (.5 WCU)

Strongly Consistent Reads: Less than a second to reach consistency after a write (1 WCU)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How many data centers are DynamoDB instances spread across?

A

DynamoDB instances are spread across 3 geographically distinct data centers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of storage are DynamoDB instances stored on?

A

SSD Storage

17
Q

What types of Redshift Configurations are there?

A

Single node (164GB): Used for small/med business

Multi-Node
Leader Node: Manages client connections and receives queries

Compute Nodes (up to 128 nodes): store and perform queries and computations

18
Q

What is Elasticache?

A

A web service that caches critical data that requires high performance reads. Increases performance by storing data in-memory instead of on disk.

19
Q

What are the two Elasticache caching engines?

A

Memcached

redis

20
Q

When would you use Elasticache?

A

When you need to offload reads or higher read performance from data.

21
Q

When would you use Redshift?

A

When you’ve got queries that span too many tables, databases, taking up tons of I/O

22
Q

How many copies of your data are stored by default with Aurora? (physically)

A

6 copies (2 copies across 3 different AZs)

23
Q

What is Amazon Athena?

A

Athena is a query service that makes it easy to analyze data in S3 using standard SQL commands.

Supported data formats: JSON, Apache Parquet, Apache ORC

24
Q

What happens to the I/O of a single-AZ RDS instance during a DB snapshot or backup?

A

You may experience I/O being suspended briefly as the backup process initializes (under a few seconds). You may also experience higher latency for a period.

25
Q

Default MySQL port number?

A

3306

26
Q

With new RDS DB instances are automated backups enabled by default?

A

Yes, automated backups are enabled by default

27
Q

What is the maximum retention interval for RDS backups?

A

35 days

28
Q

Do you have to pay a transfer charge when replicating data from your primary RDS instance to your secondary RDS instance?

A

Nope, there is no charge associated with the action.

29
Q

Are you charged if you have an RDS database that isn’t being used?

A

Yes, RDS databases are charged based on running time