Databases Flashcards

(29 cards)

1
Q

Is RDS OLTP or OLAP?

A

RDS is OLTP as it’s row-based and not made for data warehousing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What OLTP Databases are available on AWS?

A
MySQL
MariaDB
Aurora
PostgreSQL
Oracle
SQL Server
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What OLAP service is there on AWS?

A

Redshift (dataware housing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the difference between OLAP and OLTP?

A

OLAP is for analytics, OLTP is for transaction processing.

OLAP uses columnar architecture to achieve high IO performance (same data type in a data column) as well as very good compression. Queries can span multiple tables and process/join multiple sources of data which can be very intensive.

OLTP uses row-based architecture which can be easily queried without spanning too many databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is AWS Redshift?

A

Redshift is a data warehousing service for AWS. It has fast performance and high storage capacity. It is a columnar type architecture with multi-parallel processing and advanced data compression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Is AWS Redshift encrypted?

A

At rest encryption: AES-256

In transit encryption: SSL/TLS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Multi-AZ in an RDS Instance?

A

Multi-AZ is used to synchronize your database writes to a copy in another AZ so that it can act as a failover replica in the case your primary database fails.

Used for High Availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Read-Replicas in an RDS instance?

A

Read-Replicas are read-only replicas of your primary database that you can use to gain performance/separate reads from your primary database.

This is used to gain a performance advantage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the available database services in AWS?

A

OLAP: Redshift
OLTP: RDS – MySQL, SQL Server, PostgreSQL, Oracle, Aurora, MariaDB

In-Memory: Elasticache (redis, memcached)

No-SQL: DynamoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does Aurora Scaling/Failover work?

A

Two copies of your database in each AZ. Each Aurora instance has a minimum of three AZs (6 copies of physical storage media)

Designed to transparently handle the loss of 2 copies of data without affecting database write availability and up to three copies without affecting read availability

Self-healing, data blocks and disks are continuously scanned for errors and repaired automatically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How many copies of data can Aurora handle?

A

Aurora is able to handle up to 2 copies of lost data without affecting write operations and up to 3 copies of data without affecting read availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What stops an Aurora instance from suffering data corruption/drive corruption?

A

Self-healing. Aurora constantly scans for errors and repairs them automatically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Difference between scaling DynamoDB vs scaling RDS?

A

DynamoDB can scale with “push button scaling” aka you click a button and you can increase the performance (WCU,RCU) units and size. No downtime required.

With an RDS instance you have to use read-replicas to scale out and eventually hit a limit or use a bigger instance type.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the storage consistencies of Dynamo DB?

A

Eventual Read Consistency (default) : Can take 1 second to reach consistency after a write. (.5 WCU)

Strongly Consistent Reads: Less than a second to reach consistency after a write (1 WCU)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How many data centers are DynamoDB instances spread across?

A

DynamoDB instances are spread across 3 geographically distinct data centers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of storage are DynamoDB instances stored on?

17
Q

What types of Redshift Configurations are there?

A

Single node (164GB): Used for small/med business

Multi-Node
Leader Node: Manages client connections and receives queries

Compute Nodes (up to 128 nodes): store and perform queries and computations

18
Q

What is Elasticache?

A

A web service that caches critical data that requires high performance reads. Increases performance by storing data in-memory instead of on disk.

19
Q

What are the two Elasticache caching engines?

A

Memcached

redis

20
Q

When would you use Elasticache?

A

When you need to offload reads or higher read performance from data.

21
Q

When would you use Redshift?

A

When you’ve got queries that span too many tables, databases, taking up tons of I/O

22
Q

How many copies of your data are stored by default with Aurora? (physically)

A

6 copies (2 copies across 3 different AZs)

23
Q

What is Amazon Athena?

A

Athena is a query service that makes it easy to analyze data in S3 using standard SQL commands.

Supported data formats: JSON, Apache Parquet, Apache ORC

24
Q

What happens to the I/O of a single-AZ RDS instance during a DB snapshot or backup?

A

You may experience I/O being suspended briefly as the backup process initializes (under a few seconds). You may also experience higher latency for a period.

25
Default MySQL port number?
3306
26
With new RDS DB instances are automated backups enabled by default?
Yes, automated backups are enabled by default
27
What is the maximum retention interval for RDS backups?
35 days
28
Do you have to pay a transfer charge when replicating data from your primary RDS instance to your secondary RDS instance?
Nope, there is no charge associated with the action.
29
Are you charged if you have an RDS database that isn't being used?
Yes, RDS databases are charged based on running time