Chapter 8: Data Mining Flashcards
What is Data Mining?
The process of analyzing large amounts of data to discover patterns, relationships, and trends that cannot easily be discovered through slicing and dicing techniques.
What is: “The process of analyzing large amounts of data to discover patterns, relationships, and trends that cannot easily be discovered through slicing and dicing techniques.” ?
Data Mining
What is the greatest advantage of data mining?
The capacity for analyzing large datasets where the nuggets are difficult to identify.
What are the 7 -V’s of Big Data?
Value Volatility Volume Variety Velocity Veracity Variability
The fact, we can assert that ______ is the “big” in big data.
Volume
______ refers to the speed at which data are generated by and collected from source systems.
Velocity
What refers to how quickly data can be processed so as to provide a feedback loop.
Velocity
Changes in the meaning of data over time or in context is known as what?
Variability
In language what is an example of word variablility?
Hot, cool, fit: context of the word
______ is the reliability or truthfulness of data?
Veracity
What refers to the lifespan of data?
Volatility
What is the biggest driving force in data?
Value: does it provide value to support business decisions
What is the most predictable type of data mining system?
deterministic system
What type of system is a Chaotic system?
it is deterministic but is highly sensitive to fluctuations in input conditions
What is a stochastic system?
Non-deterministic system, they do not have deterministic laws or rules.
What are the 5 steps of the data mining process?
Data Staging Data Mining Model Validation Deployment Monitoring
_____ models use past data to analyze and discover trends, patterns, and relationships with the goal of applying results to future data to make predictions.
Predictive
_____ models for analytics are used to describe trends, patterns, and relationships in existing data without making predictions about future outcomes.
Descriptive
What is unsupervised data mining?
Another name for descriptive modeling
Predictive modeling has which other name?
supervised data mining
What is a prescriptive model?
They answer the question, what action should be taken based upon the results from prescriptive models.
Anticipatory models do what?
seek to determine what might happen in the future in order to inform business decisions.