Unit 3 Flashcards
The amount of data we are creating as companies and individuals is greatly increased
Volume
Format and types of data greatly differs
Variety
New data is being created every millisecond in our world
Velocity
- Lots of data exist in old legacy systems.
- For companies to take advantage of data information must be accessible.
- Is a large database repository that consolidates silo’ed data
Data Warehouse
- Data is in too many places
- Data is “dirty” or missing values
- Data is not maintained consistently
- Data is hard to retrieve from legacy systems.
- Too much data
Problems with Operational Data
- Integrate data from multiple source
- Process data by sorting, grouping, summing, averaging, and comparing
- Results formatted into reports
- Improve decision making process
Reporting System
- Non trivial discovery of novel, valid, comprehensible and potentially useful patterns from data
- Descriptive and predictive
- Looking for patterns and relationships to anticipate events or predict outcomes
Data Mining Systems
- Create value from intellectual capital
- Collect and share hum knowledge
- Supported by five components of IS
- Foster innovation
- Increase company organizational responsive
Knowledge Management System
- Encapsulate experts knowledge
- Produce If/Then Rules
- Improve diagnosis and decision making non experts
Expert System
is extracting useful information from large datasets that would be hard to analyze without an information system
Data Mining Systems
A and B both occurred / A occurred
Confidence
Confidence / Benchmark Confidence
Lift
% B occurs overall in a dataset overall
Benchmark Confidence
A sequence of activities to accomplish an objective
Process Flow
Systematic way of creating, assessing, and altering business processes as needed
Business Process Management (BPM)