Chapter 6: Foundations of Business Intelligence: Databases and Information Management Flashcards
Analytic Platform
Preconfigured hardware-software system that is specifically designed high-speed analysis of large datasets.
Attribute
A piece of information describing a particular entity.
Big Data
Datasets with volumes so huge that they are beyond the ability of typical relational DBMS to capture, store, and analyze. The data are often unstructured or semi-structured.
Bit
A binary digit representing the smallest unit of data in a computer system. It can only have one of two states, representing 1 or 2..
Byte
A string of bits, usually eight, used to store one number or character in a computer system.
Data Administration
A special organizational function for managing the organization’s data resources, concerned with information policy, data planning, maintenance of data dictionaries, and data quality standards.
Data Cleansing
Activities for detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant. Also known as data scrubbing.
Data Definition
DBMS capability that specifies the structure and content of the database.
Data Dictionary
An automated or manual tool for storing and organizing information about the data maintained in a database.
Data Governance
Policies and processes for managing the availability, usability, integrity, and security of the firm’s data.
Data Inconsistency
The presence of different values for same attribute when the same data are stored in multiple locations.
Data Manipulation Language
A language associated with a database management system that end users and programmers use to manipulate data in the database.
Data Mart
A small data warehouse containing only a portion of the organization’s data for a specified function or population of users.
Data Mining
Analysis of large pools of data to find patterns and rules that can be used to guide decision making decision making and predict future behavior.
Data Quality Audit
A survey and/or sample of files to determine accuracy and completeness of data in an information system.
Data Redundancy
The presence of duplicate data in multiple data files.
Data Warehouse
A database, with reporting and query tools, that stores current and historical data extracted from various operational systems and consolidated for management reporting and analysis.
Database
A group of related files.
OR A collection of data organized to service many applications at the same time by storing and managing data so that they appear to be in one location.
Database Administration
Refers to the more technical and operational aspects of managing data, including physical database design and maintenance.