IT: Chapter 5: Foundations of business Intelligence Flashcards
Database
a collection of related files containing records on people, places, or things.
Entity
generalized category representing person, place, thing on which we store and maintain information
Attributes
specific characteristics of each entity
Relational database
organize data into two-dimensional tables (relations) with columns and rows
Fields
store data representing an attribute
Rows
store data for separate records, records or tuples
Key Field
uniquely identifies each record
Primary Key
the unique identifier for all the information in any row of the table, and this primary key cannot be duplicated
Foreign Key
a look-up field to find data about the supplier of a specific part
Entity-relationship diagram
used to clarify table relationships in a relational database
Join table or an intersection relation
link two tables in a table that joins information
Normalization
the process of streamlining complex groups of data to minimize redundant data elements and awkward many-to-many relationships, and increase stability and flexibility
Referential Integrity rules
to ensure that relationships between coupled tables remain consistent
Database Management Systems DBMS
a specific type of software for creating, storing, organizing, and accessing data from a database
Logical View
how end users view data
Physical View
show s how data are actually organized and structured on physical storage media, such as a hard disk
Operations of a Relational DBMS ABBREV
SJP
Operations of a Relational DBMS
Select
Join
Project
Select
Creates a subset of all records meeting stated criteria
Join
combines relational tables to represent the server with more information than is available from individual tables
Project
permits user to create new tables containing only desired information
Data Definition
specify the structure of the content of the database
Data Dictionary
automated or manual file that stores definition of data elements and their characteristics
Data Manipulation language
used to add, change, delete, and retrieve the data in the database
Structured Query Language (SQL)
to retrieve information they needed from the database
Object Oriented DBMS
stores the data, and procedures that act on those data as objects that can be automatically retrieved and shared
Hybrid object-relational DBMS
Provide capabilities for both object-oriented and relational DBMS
Data Warehouse
a database that stores current and historical data of potential interest to decision makers throughout the company
Data Mart
a subset of a data warehouse in which a summarized or highly focused portion of the organization’s data is placed in a separate database for a specific population of users.
Online analytical Processing (OLAP)
supports multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions
Data mining
finding hidden patterns and relationships in large databases and infers rules from them to predict future behavior
Associations
occurrences inked to single event
Sequences
events linked over time
Classifications
describe the group to which an item belongs by examining existing items that have been classified and by inferring a set of rules
Clustering
works like classification when no groups have yet become defined
Forecasting
uses a series of existing values to forecast what other values will be
Predictive Analysis
uses data mining techniques, historical data and assumptions about future conditions to predict outcomes of events, such as the probability a customer will respond to an offer or purchase a specific product
Text Mining
able to extract key element from large unstructured data sets, discover patters and relationship, and summarize the information
Web Mining
the discovery and analysis of useful patterns and information from the Web
Content Mining
the process of extracting knowledge from the content of Web pages, which may include text, image, audio, and video data
Structure Mining
examines data related to the structure of a particular website
Usage Mining
examines user interaction data recorded by a Web server
Information Policy
specifies the organization’s rules for sharing, disseminating, acquiring, standardizing, classifying, and inventory information
Data administration
responsible for specific policies and procedures through which data can be managed as an organizational resource
Database administration
database design management group responsible for defining and organizing the structure and content of the database, and maintaining the database
Poor Data Quality
major obstacle to successful customer relationship management
Data Quality Audit
structured survey of the accuracy and completeness of the data
Data Cleansing
aka data scrubbing, consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted, or redundant.