CHAPTER 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES INFORMATION MANAGEMENT Flashcards
A grouping of characters into a word, a group of
words, or a complete number
field
group of related fields, such as the student’s name, the course taken, the date, and the grade
record
a group of records of the same type.
file
is a person, place, thing, or event on which we store and maintain information.
entity
characteristic or quality describing a particular entity
attribute
occurs when different groups in an organization independently collect the same piece of data and store it independently of each other.
Data redundancy
where the same attribute may have different values.
data inconsistency
refers to the coupling of data stored in files and the
specific programs required to update and maintain those files such that changes in programs require changes to the data.
Program-data dependence
is software that permits an organization to centralize data, manage them efficiently, and provide access to the stored data by application programs.
database management system (DBMS)
A group of related files also it is a collection of data organized to serve many applications efficiently by centralizing the data and controlling redundant data.
database
Relational databases represent data as two-dimensional tables (called relations). Tables may be referred to as files. Each table contains data on an entity and its attributes.
relational DBMS
The actual information about a single supplier that resides in a table is called a row. Rows are commonly referred to as records, or in very technical terms, as tuples.
tuples
The field in the table that uniquely identifies each record so that the record can be retrieved, updated, or sorted on
key field
Each table in a relational database has one field that is
designated as its primary key. This key field is the unique identifier for all the information in any row of the table and this primary key cannot be duplicated.
primary key
is essentially a lookup field to look up data about the supplier of a specific part.
foreign key
stores the data and procedures that act on those
data as objects that can be automatically retrieved and shared.
object-oriented DBMS
capability to specify the structure of the content
of the database.
data definition
is an automated or manual file that stores definitions of data elements and their characteristics.
data dictionary
used to add, change, delete, and retrieve the data in the database.
data manipulation language
The most prominent data manipulation language today
Structured Query Language, or SQL
The process of creating small, stable, yet flexible and
adaptive data structures from complex groups of data
normalization.
rules to ensure that relationships between coupled tables remain consistent
referential integrity
Database designers document their data model using
diagrams that illustrates the relationship between the entities
entity-relationship diagram
is a database that stores current and historical data of
potential interest to decision makers throughout the company.
data warehouse
is a subset of a data warehouse in which a summarized or highly focused portion of the organization’s data is placed in a separate database for a specific population of users.
data mart
supports multidimensional data analysis, enabling users to
view the same data in different ways using multiple dimensions.
online analytical processing (OLAP)
provides insights into corporate data that cannot be obtained with OLAP by finding hidden patterns and relationships in large databases and inferring rules from them to predict future behavior.
Data mining
types of information obtainable from data mining
Associations:are occurrences linked to a single event.
sequences: events are linked over time.
Classification: recognizes patterns that describe the group to which an item belongs by examining existing items that have been classified and by inferring a set of rules.
Clustering: works in a manner similar to classification when no groups have yet been defined.
forecasting:It uses a series of existing values to forecast what other values will be.
use data mining techniques, historical data, and assumptions about future conditions to predict outcomes of events, such as the probability a customer will respond to an offer or purchase a specific product.
Predictive analytics
used with unstructured data, to help businesses analyze these such data
Text mining
The discovery and analysis of useful patterns and information from the World Wide Web
Web mining
In a client/server environment, the DBMS resides on a
dedicated computer
database server
specifies the organization’s rules for sharing,
disseminating, acquiring, standardizing, classifying, and inventorying information.
information policy
is responsible for the specific policies and procedures through which data can be managed as an organizational
resource
Data administration
deals with the policies and processes for managing the availability, usability, integrity, and security of the data employed in an enterprise, with special emphasis on promoting privacy, security, data quality, and compliance with government regulations.
data governance
responsible for defining and organizing the structure and content of the database, and maintaining the database.
database administration
a structured survey of the accuracy and level of
completeness of the data in an information system. Data quality audits can be performed by surveying entire data files, surveying samples from data files, or surveying end users for their perceptions of data quality.
data quality audit
consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted, or redundant.
Data cleansing or data scrubbing
Generalized category representing person, place, thing on which we store & maintain info
Entity
-Specific characteristics of each entity
-Supplier name, address
-Part description, unit price, supplier
Attributes
-Organize data into 2-dimenstional tables w/ columns & rows
-Cone table
-Rows store data for separate records
Relational database
Collection of related files containing records on people, places or things
Database
Uniquely identifies each record
Key Field
Store data representing an attribute
Fields
-One field in each table
-Cannot be duplicated
-Provides unique identifier for all info in any row
Primary Key
-Specific type of software for creating, storing, organizing & accessing data from a database
-Separates the logical & physical views of data
Database Management System (DBMS)
How data are actually structured & organized
-Microsoft Access
-DB2
-Oracle Database
Physical View
Data manipulation lanuage
- SQL
Report generation
Querying & reporting
How end users view data
Logical View
Data definition capabilities
Specify structure of content of database
Capabilities of DBMS
-Data definition
-Data dictionary
-Querying & reporting
Automated or manual file storing definitions of data elements and their charateristics
Data dictionary
Operations of relational DBMS
-Select
-Join
-Project
Creates a subset of all records meeting stated criteria
Select
Combines relational tables to present the server with more information than is available from individual tables
Join
-Creates a subset consisting of columns in a table
-Permits user to create new tables containing only desired info
Project
A special-purpose programming language designed for managing data held in a relational database management system…
Structured Query Language (SQL)
-Stores data & procedures that act on those data as object to be retrieved and shared
-Used to manage multimedia components
-Relatively slow compared to relational DBMS
Object oriented database
-Database that stores current & historical data that may be of interest to decision makers
-Standardizes data from many systems
-Data can be accessed but not altered
Data warehouse
Subset of warehouses, highly focused and isolated for a specific population of users
Data mart
Tools for consolidating & providing access to large amounts of data to improve decision making
Business Intelligence
-supports multidimensional analysis
-allows users to view the same data in different ways
-obtain online answers to questions in a rapid time
Online Analytical Processing (OLAP)
-Finds hidden patterns & relationships in large databases & infers rules from them to predict future behavior
Data Mining
Info obtained from data mining
-Associations
-Sequences
-Classifications
-Clustering
-Forecasting
Design & Mgmt group responsible for defining & organizing the structure & content of the database, and maintaining the database
Database administration
-Uses mining techniques, historical data & assumptions about future conditions to predict outcomes of events such as the probability a customer will respond to an offer or purchase a specific product
Predictive analysis
States organization’s rules for organizing, managing, storing & sharing info
Information policy
Responsible for specific policies and procedures through which data can be managed as a resource
Data administration
Major obstacle to successful customer reltionship
Poor data quality
Structured survey of the accuracy and completeness of data
Data quality audit
Detects & corrects incorrect, incomplete, improperly formatted & redundant data
Data cleansing