Data Storage & Database Design Flashcards
S2 M6
- a complex database schema that normalizes dimension tables into multiple related tables
- greatest number of foreign keys
- added flexibility & more detailed info about dimensions
snowflake schema
- a simple & common dimensional modeling schema with a central fact table surrounded by dimension tables
- fewer foreign keys
- easyer to understand & query
star schema
- simplest type of database schema where all data is stored in a single table
- lacks the complexity of relationships between multiple tables
- not suitable for storing detailed or structured data
flat model
database schema that organizes data in a tree-like structure where each parent has multiple child records
- one-to-many relationships
- not used for complex relational databases
hierarchical model
a very large data repository that is centralized & used for reporting & analysis rather than transactional purposes; must be continuously updated
data warehouse
a repository of transactional data from multiple sources, often an interim area between data sources & data warehouses
ODS operational data storage
a large data repository that is more focused on a specific purpose like marketing or logistics (subset of data warehouse)
data mart
a large repository that contains both structured & unstructured data, with data mostly being in its natural or raw form
data lake
a database design technique that reduces data redundancy & eliminates undesirable characteristics
normalization
List the 4 benefits associated with relational databases.
completeness
no redundancy
business rules enforcement
communication & integration of business processes
files that contain plain text with no structural interrelationships within that file (excel or CSV)
flat file
a column in a relational database table (must be unique)
attribute
a row in a relational database table
record
space created at the intersection of a column & row in a table in which data is entered
field
- each cell (field) in a table must contain only 1 piece of info
- each record in every table must be uniquely identified (primary or composite key)
1NF first normal form
all non-key attributes are required to depend on the entire primary key
2NF second normal form
ascertain that each column in a table describes only the primary key
3NF third normal form
a set of instructions to tell the database engine how to organize data to be in compliance with data models
database schema
conceptual representations of the data structures in an information system & are not restricted to relational databases only
data model
a high-level, big-picture representation of the data structures in an information system
conceptual data model
a more detailed representation of the data structures in an information system at the level of the data itself, giving more detail than a conceptual data model
logical data model
the most detailed representation of data structures that specifies how the data will be stored in the database
physical data model
table that contains measures or metrics (facts) that measure the business & have no descriptive elements
fact table