Module 6 - Database Services Flashcards

1
Q

What relational databases does AWS offer?

A

Amazon RDS
Amazon RedShift
Aurora

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What NoSQL databases does AWS offer?

A

DynamoDB

ElastiCache

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do I need to consider when deciding on a database product?

A
  • Read and write needs
  • Total storage requirements
  • Typical object size and nature of access to these objects
  • Durability requirements
  • Latency requirements
  • Maximum concurrent users to support
  • Nature of queries
  • Required strength of integrity controls
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do you get high availability for database workloads?

A

Use Multi-AZ deployments with RDS. It auto replicates in another AZ; in case of failure, it switches over to the backup. Then the backup becomes primary and the newly spun up db becomes the backup.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How should I handle read-heavy workloads?

A

RDS with a Read Replicas. The primary DB is read/write, replicas are read-only. Replicas can be promoted to primary read/write in case of failure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does RDS manage encryption at rest?

A

AWS KMS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Aurora?

A

A relational database compatible with MySQL and PostgreSQL, but faster. Built for cloud. Managed by RDS so no maintenance.

Stored across 3 AZs with 2 backup copies for each instance, with up to 15 replicas per cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What database should I use if I need high availability?

A

Aurora.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What if my database needs are highly variable?

A

Use Aurora Serverless. Auto scales on-demand from a pool of prewarmed instances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

I need a NoSQL database but I don’t want to deal with managing the cluster.

A

DynamoDB. Handles all the architecture decisions, optionally auto-scales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are 2 ways to auto-scale?

A

Provisioned - set a min and max, looks like steps

On-Demand - for spiky workloads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the consistency options for DynamoDB?

A
Strongly consistent
Eventually consistent (default)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an AWS option for a data warehouse?

A

Amazon RedShift. Fully managed, cloud-native.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some use cases for RedShift?

A
  • Business intelligence: run high-performance queries on petabytes of semi-structured and structured data
  • Operational analytics on events: Bring together structured data from your data warehouse and semi-structured data such as application logs from your S3 data lake to get real-time operational insights on your applications and systems.
  • Data as a service: Share data inside and outside your organization for secure and governed collaboration on live data with Amazon Redshift data sharing.
  • Predictive analytics: Use SQL to automatically create, train, and deploy Amazon SageMaker models on the data in your data warehouse with Amazon Redshift Machine Learning (preview)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does RedShift organize data?

A

By column; ideal for warehousing and analytics, large data sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What if I need to auto-scale RedShift?

A

Turn on concurrency scaling.

17
Q

What are the node types for RedShift?

A

3rd generation (RA3) - optimize your data warehouse by scaling and paying for compute and managed storage independently.

Dense compute nodes (DC2) - compute-intensive data warehouses with local SSD storage included.

18
Q

What are 2 caching strategies for a database?

A

Lazy-loading - update cache only on a miss

Write-through - update cache on every write

19
Q

What is ElastiCache?

A

A service for in-memory data store or cache environment in the cloud. Support Redis, Memcached. Creates a caching tier for data-intensive apps. Works as an in-memory data store.

20
Q

What is TTL?

A

An integer value that specifies the number of seconds or milliseconds until the key expires. When the key expires the cache goes back to the database to get fresh data.

21
Q

What can you use to increase response times on DynamoDB?

A

DynamoDB Accelerator(DAX). It’s a caching in memory service.

22
Q

How can you migrate an on-prem database to the cloud?

A

AWS DMS. Database Migration Service. For Oracle, PostgreSQL, SQL Server, Amazon Redshift, Aurora, MariaDB, and MySQL.

automatically handles formatting of the source data for consumption by the target database

23
Q

What if I am migrating to a different engine?

A

For heterogeneous migrations, you can use the AWS Schema Conversion Tool (AWS SCT).

Automatically converts the source database schema and code objects. The conversion includes views, stored procedures, and functions.

24
Q

What is the benefit of using RDS in a multi-AZ configuration?

A

Automatic failover across AZs.

25
Q

What type of ElastiCache installation offers sorting and ranking for data sets?

A

ElastiCache for Redis.

26
Q

What are DynamoDB global tables?

A

Multiple replica tables (one per AWS Region) that DynamoDB treats as a single unit. Tables can be in different regions.

27
Q

I need to give engineers access to ElastiCache for Redis sessions. How do you secure the session requiring a password before they can execute Redis commands?

A

Authenticate the users using Redis AUTH (make a new Redis cluster and enable auth token and transit encryption).

Using Redis AUTH improves data security by requiring users to enter a password before executing Redis commands.