Data Concepts and Tools Flashcards
Final Exam Review
OLAP
(Online Analytical Processing): Analytical system used to answer multi-dimensional analytical queries quickly
OLTP
(Online Transaction Processing): Supports daily transactions (insert, update, delete) in real-time systems
ETL (Extract, Transform, Load)
Process used to extract data from source systems, transform it into a suitable format, and load it into a data warehouse
Metadata
Data that describes other data (e.g., author, date created)
Data
Raw facts or figures with no context
Information
Data processed to be meaningful
Knowledge
Insight derived from information through experience or analysis
Data Governance
Policies, procedures, and roles that ensure data quality, privacy, and security
Data Quality
Accuracy, completeness, consistency, timeliness, and reliability of data
Databases
Structured collections of related data
Data Warehouses
Central repositories of integrated data from various sources, used for reporting and analysis
Data Marts
Subsets of data warehouses focused on specific business lines or teams
Referential Integrity
Ensures relationships between database tables are consistent (e.g., foreign keys referencing valid primary keys)
Table Field Relationships
Define how data tables are connected (e.g., one-to-many, many-to-many
Normalization
Organizing database structure to reduce redundancy and improve data integrity
SQL Queries
Structured Query Language used to interact with relational databases (e.g., SELECT, INSERT, UPDATE, DELETE)
Data Validation
Ensuring entered data meets specific criteria (e.g., data type, range)
Data Hierarchy
Logical structure from bits → bytes → fields → records → files → databases