CHP. 8 Flashcards
Information overload
An overabundance of irrelevant data.
Dirty data
Problematic data. Examples are a value of B for customer gender and a value of 213 for customer age. Other examples are a value of 999-999-9999 for a North American phone number, a part colour of green, and an email address of WhyMe@GuessWhoIAM.org. All these values are problematic when data mining.
Granularity
The level of detail in data. Customer name and account balance are large granularity data. Customer name, balance, and the order details and payment history of every customer order are smaller granularity.
Clickstream data
E-commerce data that describe a customer’s clicking behaviour. Such data include everything the customer does at the website.
Online transaction processing (OLTP)
Collecting data electronically and processing transactions online.
Online Analytic Processing or (OLAP) and Decision support systems (DSSs)
Systems that focus on making data collected in OLTP useful for decision making.
Online analytic processing (OLAP)
A dynamic type of reporting system that provides the ability to sum, count, average, and perform other simple arithmetic operations on groups of data. Such reports are dynamic because users can change the format of the reports while viewing them.
Drill down
With an OLAP report, to further divide the data into more detail.
Data resource challenge
Occurs when data are collected in OLTP but are not used to improve decision making.
Business intelligence (BI) system
A system that provides the right information, to the right user, at the right time. A tool produces the information, but the system ensures that the right information is delivered to the right user at the right time.
Group decision support systems (GDSSs)
An application that enables more than one individual to undertake a decision. Often includes voting and brainstorming functions.
Reporting systems
Systems that create information from disparate data sources and deliver that information to the proper users on a timely basis.
Data-mining system
Information system that processes data using sophisticated statistical techniques, such as regression analysis and decision-tree analysis, to find patterns and relationships that cannot be found by simpler operations, such as sorting, grouping, and averaging.
Market-basket analysis
A data-mining technique for determining sales patterns. A market-basket analysis shows the products that customers tend to buy together.
Knowledge management (KM) systems
Information systems for storing and retrieving organizational knowledge, whether that knowledge is in the form of data, documents, or employee know-how.