Chapter 8: Data Mining Flashcards

1
Q

What is Data Mining?

A

The process of analyzing large amounts of data to discover patterns, relationships, and trends that cannot easily be discovered through slicing and dicing techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is: “The process of analyzing large amounts of data to discover patterns, relationships, and trends that cannot easily be discovered through slicing and dicing techniques.” ?

A

Data Mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the greatest advantage of data mining?

A

The capacity for analyzing large datasets where the nuggets are difficult to identify.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 7 -V’s of Big Data?

A
Value
Volatility
Volume
Variety
Velocity
Veracity
Variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The fact, we can assert that ______ is the “big” in big data.

A

Volume

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

______ refers to the speed at which data are generated by and collected from source systems.

A

Velocity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What refers to how quickly data can be processed so as to provide a feedback loop.

A

Velocity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Changes in the meaning of data over time or in context is known as what?

A

Variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

In language what is an example of word variablility?

A

Hot, cool, fit: context of the word

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

______ is the reliability or truthfulness of data?

A

Veracity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What refers to the lifespan of data?

A

Volatility

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the biggest driving force in data?

A

Value: does it provide value to support business decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the most predictable type of data mining system?

A

deterministic system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of system is a Chaotic system?

A

it is deterministic but is highly sensitive to fluctuations in input conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a stochastic system?

A

Non-deterministic system, they do not have deterministic laws or rules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the 5 steps of the data mining process?

A
Data Staging
Data Mining Model
Validation
Deployment
Monitoring
17
Q

_____ models use past data to analyze and discover trends, patterns, and relationships with the goal of applying results to future data to make predictions.

A

Predictive

18
Q

_____ models for analytics are used to describe trends, patterns, and relationships in existing data without making predictions about future outcomes.

A

Descriptive

19
Q

What is unsupervised data mining?

A

Another name for descriptive modeling

20
Q

Predictive modeling has which other name?

A

supervised data mining

21
Q

What is a prescriptive model?

A

They answer the question, what action should be taken based upon the results from prescriptive models.

22
Q

Anticipatory models do what?

A

seek to determine what might happen in the future in order to inform business decisions.