Module 2 - Data Mining and Visualization Flashcards

1
Q

Which are the two frameworks which describe ML solutions?

A
  • Descriptive, Diagnostic, Predictive, Prescriptive

- Classification, Regression, Clustering, Association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which of the ML learning techniques are supervised and what does it mean?

A

Classification, Regression

The algorithm is trained with labeled data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What’s to prefer if choosing between more and better data or a more sophisticated algorithm?

A

More and better data generally beats more sophisticated algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain the “Toyota-way” to determine bottlenecks in short.

A

Two binary machine states, active and not active. Detect the bottleneck by simply detecting which machine has the longest active time period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name another aspect than high active periods which can indicate a bottleneck?

A

Unique machine behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does skewness in measures of shape refer to?

A

Refers to the distortion or asymmetry in a normal distribution§

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How much skew does a normal distribution have?

A

Zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does positively skewed data mean?

A

Mean greater than median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How would you describe a cluster analysis?

A

A technique to find patterns in unclassified data, by making clusters with objects that are similar to each other but different from objects in other clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the role of a data producer?

A

Generate data

Must understand why the data is collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the role of a data user?

A

Use and create relevance out of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the role of a data custodian?

A

Store and maintain data, responsible for its security as well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly