Exam 1 Terms Flashcards
Datasets that are too large and complex for businesses’ existing systems to handle utilizing their traditional capabilities to capture, store, manage, and analyze these datasets.
Big Data
A data approach that attempts to assign each unit in a population into a few categories potentially to help with predications.
Classification
A data approach that attempts to divide individuals (like customers ) into groups (or clusters) in a useful way.
Clustering
A data approach that attempts to discover associations between individuals based on transactions involving them.
Co-occurrence grouping
The process of evaluating data with the purpose of drawing conclusions to address business questions.
Data Analytics
Centralized repository of descriptions for all of the data attributes of the dataset.
Data dictionary
A data approach that attempts to reduce the amount of information that needs to be considered to focus on the most critical items.
Data reduction
A data approach that attempts to predict a relationship between two data items.
Link Prediction
A variable that predicts or explains another variable.
Predictor variable.
A data approach that attempts to characterize the “typical” behavior of an individual, group, or population by generating summary statistics about the data.
Profiling
A data approach that attempts to estimate or predict, for each unit, the numerical value of some variable using some type of statistical model.
Regression
A variable that responds to, or is dependent on, another.
Response variable
A data approach that attempts to identify similar individuals based on data known about them.
Similarity matching
Data that are organized and reside in a fixed field with a record or a file.
Structured data
Data that do not adhere to a predefined data model in a tabular format.
Unstructured data
A system that records, processes, reports, and communicates the results of business transactions to provide financial and nonfinancial information for decision- making purposes.
Accounting information system
A special case of primary key that exists in linking tables. (made up of two primary keys in the table that it is linking)
Composite primary key
An information system for managing all interactions between the company and its current and potential customers.
Customer Relationship Management system (CRM)
Centralized repository of descriptions for all of the data attributes of the dataset.
Data dictionary
A method for obtaining data if you do not have access to obtain the data directly yourself.
Data request form
Attributes that exist in relational databases that are neither primary nor foreign keys. Provide business information.
Descriptive attributes
A category of business management software that integrates applications from throughout the business into one system.
Enterprise Resource Planning system (ERP)
The extract, transform, and load process that is integral to mastering the data.
ETL
A means of storing data in one place, such as in an Excel spreadsheet, as opposed to storing the data in multiple tables, such as in a relational database.
Flat line