Chapter 8: Data Mining Flashcards
What is Data Mining?
The process of analyzing large amounts of data to discover patterns, relationships, and trends that cannot easily be discovered through slicing and dicing techniques.
What is: “The process of analyzing large amounts of data to discover patterns, relationships, and trends that cannot easily be discovered through slicing and dicing techniques.” ?
Data Mining
What is the greatest advantage of data mining?
The capacity for analyzing large datasets where the nuggets are difficult to identify.
What are the 7 -V’s of Big Data?
Value Volatility Volume Variety Velocity Veracity Variability
The fact, we can assert that ______ is the “big” in big data.
Volume
______ refers to the speed at which data are generated by and collected from source systems.
Velocity
What refers to how quickly data can be processed so as to provide a feedback loop.
Velocity
Changes in the meaning of data over time or in context is known as what?
Variability
In language what is an example of word variablility?
Hot, cool, fit: context of the word
______ is the reliability or truthfulness of data?
Veracity
What refers to the lifespan of data?
Volatility
What is the biggest driving force in data?
Value: does it provide value to support business decisions
What is the most predictable type of data mining system?
deterministic system
What type of system is a Chaotic system?
it is deterministic but is highly sensitive to fluctuations in input conditions
What is a stochastic system?
Non-deterministic system, they do not have deterministic laws or rules.