B6 Flashcards

1
Q

What is a data lake?

A

Contains data that is structured and unstructured mostly being in its raw form.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a data warehouse?

A

A very large data repository that is centralized and used for reporting and analysis rather than for transactional purposes. A data warehouse pulls data either directly from enterprise systems with transactional data or from an ODS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an operational data store?

A

a repository of transactional data from multiple sources and is often an interim area between a data source and data warehouses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a data mart?

A

focused on a specific purpose, such as marketing or logistics, then a data warehouse. A data mart is often a subset of a data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the steps to reduce data redundancy?

A
  1. Normalizing data: All data has one peace of information (One Valuu Per Columb) and every record in every tale has a unique identifier.
  2. Conform the data: Requires all non key attributes to depend on the entire primary key.
  3. Ascertain: When you check to make sure that there are no non-key attributes depending on other non-key attribute.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly