B6 Flashcards
What is a data lake?
Contains data that is structured and unstructured mostly being in its raw form.
What is a data warehouse?
A very large data repository that is centralized and used for reporting and analysis rather than for transactional purposes. A data warehouse pulls data either directly from enterprise systems with transactional data or from an ODS.
What is an operational data store?
a repository of transactional data from multiple sources and is often an interim area between a data source and data warehouses.
What is a data mart?
focused on a specific purpose, such as marketing or logistics, then a data warehouse. A data mart is often a subset of a data warehouse.
What are the steps to reduce data redundancy?
- Normalizing data: All data has one peace of information (One Valuu Per Columb) and every record in every tale has a unique identifier.
- Conform the data: Requires all non key attributes to depend on the entire primary key.
- Ascertain: When you check to make sure that there are no non-key attributes depending on other non-key attribute.