Databases Flashcards

1
Q

Structured Data

A

data is often organized to support transactional and analytical applications. Structured data is most commonly stored in relational databases but can also be stored in non-relational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Unstructured Data

A

data is not organized in any distinguishable or predefined manner. Common stores for unstructured data are non-relational key-value databases. Unstructured data is full of irrelevant information, which means data needs to first be processed to perform any kind of meaningful analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Semistructured

A

data can be just as predictable and organized as structured data. The difference is that semistructured data is flexible and can be updated without the requirement to change the schema for every single record in a table. non-relational stores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

OLAP

A

Relational DB online Transactional Processing. Optimize for Read

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

OLTP

A

Relational DB Online Transaction Processing. Optimize for Write

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Aurora

A

Aurora is MySQL and PostgreSQL compatible.

log structured distributed storage layer

auto 6 copies of data across AZs
backed up to S3 with snapshots
serverless

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Redshift

A

Amazon Redshift delivers 10 times faster performance than other data warehouses by using machine learning, massively parallel query execution, and columnar storage on high-performance disk. You can set up and deploy a new data warehouse in minutes. Run queries across petabytes of data in your Amazon Redshift data warehouse and exabytes of data directly from your data lake built on Amazon Simple Storage Service (Amazon S3) with Amazon Redshift Spectrum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Steps to create RDS

A
  1. VPC
  2. Subnets (requires 2 in 2 different AZs)
  3. EC2
  4. Security Group
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Aurora

A
  1. DB Instance Class (mem optimized or burstable)
    2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Foreign Key

A

Used to create relationships between tables in a relational database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

updating the schema

A

Adding a column in a nonelational db is not required to update. Relational update is required.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

T/F Nonrelational databases are optimized for storage?

A

False: they are optimized for compute.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which DB type scales vertically

A

Relational

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

T/F both types of DB use OLTP and OLAP?

A

Flase. Only relations uses OLAP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Key Value

A

Uses in Non-relational
Typicall stored in one table.
Can handle varied data
flexible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Document DBs

A

Uses Non relations
Python and Node.js
elements are person, place or thing.Mongo DB key word

17
Q

In-Memory DB

A

Real time access to data
nonrelational
frequently accesses not frequently updated
cacheing gaming and session stortes

18
Q

Graphy DB

A

Nonrelational
store data as nodes
visualize data

19
Q

Elasticach for intensive apps

A

Amazon ElastiCache to support data-intensive apps or improve the performance of your existing apps by retrieving data from high throughput and low latency in-memory data stores. This service offers fully managed Redis and Memcached cache engines for in-memory data stores. ElastiCache is a popular choice for gaming, advertising technology (ad tech), financial service, healthcare, and Internet of Things (IoT) apps.

20
Q

Redis and Inmemory

A

” elasticache key word” Uses as noralational engines for quick acces to data

21
Q

Amazon DynamoDB

A

can handle more than 10 trillion requests per day and support peaks of more than 20 million requests per second. More than 100,000 AWS customers have chosen DynamoDB as their key-value and document store database for mobile, web, gaming, ad tech, IoT, and other applications that need low-latency data access at any scale. DynamoDB supports ACID-compliant transactions

uses partition keys for table and can use sort keys for sort

If no sort keys primary key and partition are the same.

billed for each read and write

22
Q

Neptune

A

fully managed graph database service that makes it easy to build and run applications that work with highly connected data sets used to discover potential fraudulent behavior before it happens. This starts with finding interactions between products, locations, and devices and then mapping those data points to individual users, customers, and/or employees.

Neptune graph use cases include recommendation engines, fraud detection, knowledge graphs, drug discovery, and network security.

Can be structured, semistructured, or unstructured.

23
Q

Amazon DocumentDB

A

reliable, and fully managed database service that allows you to set up, operate, and scale MongoDB-compatible databases in the cloud. With Amazon DocumentDB, you can run the same application code and use the same drivers and tools that you use with MongoDB.

Semistructured data

24
Q

RDS security groups facts

A

They don’t exist. Use DB security groups.

25
Q

What are clustered indexes used with

A

Relational DBs only or document DB

26
Q

heterogeneous migration

A

Heterogeneous migrations, where you migrate between different database engines, requires use of the AWS Schema Conversion Tool (AWS SCT) to translate your database schema to the new platform.

27
Q

Redshift comprised of

A

Amazon Redshift clusters are comprised of nodes. Compute nodes divide work among slices. Each slice is assigned a portion of the node’s memory and drive space. When you connect to an Amazon Redshift cluster, you use the SQL endpoint.

There are only two types of nodes in Amazon Redshift: a single leader node and one or more compute nodes. Amazon Redshift cannot use OLE DB drivers.

28
Q

Document DB

A

Amazon DocumentDB clusters can only be run in an Amazon VPC. Amazon DocumentDB decouples storage and compute, enabling each to scale independently. All instances within the cluster support data reads.

The basic component of Amazon DocumentDB is the cluster, which contains a storage volume and instances. Once the cluster is provisioned, you can add and remove instances as needed.

Stores data in Json forms and semistructured documents

29
Q

Benefits of Document DB

A

Flexible indexing (Correct)
Ad-hoc querying (Correct)
Powerful analytics (Correct)