Understanding Database Analytics v2 Flashcards
CRM stands for
Customer relationship management
This database is a resource containing all client information collected, governed, transformed, and shared across an organization. It includes marketing and sales reporting tools, which are useful for leading sales and marketing campaigns and increasing customer engagement.
Customer relationship management database
SCM stands for
Supply Chain Management
is management of the flow of goods, data, and finances related to a product or service, from the procurement of raw materials to the delivery of the product at its final destination.
Supply Chain Management
ETL stands for
Extract, Transform and Load
- Also known as an enterprise data warehouse, is a system used for reporting and data analysis and is considered a core component of business intelligence.
- Are central repositories of integrated data from one or more different sources.
Data Warehouse (DW)
DW stands for
Data warehouse
A centralized repository designed to store, process, and secure large amounts of structured, semistructured, and unstructured data. It can store data in its native format and process any variety of it, ignoring size limits.
Data lake
OLAP stands for
Online Analytical Processing
- A computing method that enables users to easily and selectively extract and query data in order to analyze it from different points of view.
- _____ business intelligence queries often aid in trends analysis, financial reporting, sales forecasting, budgeting and other planning purposes.
OLAP
OLTP stands for
Online Transaction Processing
A type of data processing that consists of executing a number of transactions occurring concurrently—online banking, shopping, order entry, or sending text messages, for example.
Online Transaction Processing
GSP stands for
Generalized Sequential Pattern
An algorithm used for sequence mining. The algorithms for solving sequence mining problems are mostly based on the apriori (level-wise) algorithm.One way to use the level-wise paradigm is to first discover all the frequent items in a level-wise fashion.
GSP algorithm (Generalized Sequential Pattern algorithm)
For frequent item set mining and association rule learning over relational databases. It proceeds by identifying the frequent individual items in the database and extending them to larger and larger item sets as long as those item sets appear sufficiently often in the database.
Apriori algorithm
An integrated system or database that enables the user to instantly analyze internal data and external data generated by the operation system of an enterprise over time, without the need for separate programming from multiple points of view, by integrating data by subject.
Data warehouse
Characteristics of the data warehouse (4)
- Subject-oriented
- Integrated
- Time variant
- Non-volatile
Characteristics of the data warehouse
Among the multiple types of operation system data that are managed by data business functions, the data of a specific subject needed for decision-making activities from an enterprise perspective is saved, while other data are not included.
Subject-oriented
Characteristics of the data warehouse
- The structure of a data warehouse is characterized by data consistency and physical unity through company-wide data standardization.
- When obtaining data from the operation system, a series of data conversion tasks are performed to integrate the data.
Integrated
Characteristics of the data warehouse
- To analyze past and present trends and forecast the future, a data warehouse retains data for a long time in the form of a series of snapshots.
- Users can understand the process of data change over time using the data history.
Time variant
Characteristics of the data warehouse
- A data warehouse is a read-only database that cannot be deleted or modified once it has been loaded from the operation system database.
- When a modification occurs in the operation system data, existing data are deleted. The data in the data warehouse stores the history of data at each point in time.
Non-volatile
Refers to a data modeling technique that, from a data analysis perspective, enables users to analyze large-scale data from various viewpoints.
Data warehouse modeling
Are generally organized into fact tables and dimension tables so that end users or analysts can easily analyze information.
Data
Components of Data warehouse modeling (2)
- Fact table
- Dimension table