Terms Flashcards
ACID Test
A test applied to data for atomicity, consistency, isolation and durability.
Aggregation
A process of searching, gathering and presenting data.
Algorithm
A mathematical formula or statistical process used to perform analysis of data.
Anomaly Detection
The process of identifying rare or unexpected items or events in a dataset that do not conform to other items in the dataset and do not match a projected pattern or expected behavior. Anomalies are also called outliers, exceptions, surprises or contaminants and they often provide critical and actionable information.
Big Data
Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage and analyze.
Business Intelligence
The general term used for the identification, extraction and analysis of data
Classification Analysis
A systematic process for obtaining important and relevant information about data (metadata) and assigning data to a particular group or class.
Clustering Analysis
The process of identifying objects that are similar to each other and clustering them in order to understand the differences as well as the similarities within the data.
Columnar Database or Column-oriented Database
A database that stores data by column rather than by row. In a row-based database, a row might contain a name, address and phone number. In a column-oriented database, all names are in one column, addresses in another and so on. A key advantage of a columnar database is faster hard disk access.
Comparative Analysis
Data analysis that compares two or more data sets or processes to detect patterns within very large data sets.
Correlation Analysis
A means to determine a statistical relationship between variables, often for the purpose of identifying predictive factors among the variables. A technique for quantifying the strength of the linear relationship between two variables.
Dashboard
A graphical representation of analyses performed by algorithms.
Data
A quantitative or qualitative value. Common types of data include sales figures, marketing research results, readings from monitoring equipment, user actions on a website, market growth projections, demographic information and customer lists.
Data Aggregation
The process of collecting data from multiple sources for the purpose of reporting or analysis.
Data Analyst
A person responsible for the tasks of modelling, preparing and cleaning data for the purpose of deriving actionable information from it.