8.4 Predictive Analytics Data Flow Flashcards

1
Q

What is the first step in the data flow for predictive analytics?

A

Load the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

After loading the data, what should you do next?

A

Split the data into training and testing sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is some data set aside and not used in training or tuning?

A

It’s used as a validation set to ensure unbiased evaluation during hyperparameter tuning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When should you train the model?

A

After splitting the data and setting aside validation data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three main types of predictive models to choose from?

A

Regression, Classification, and Clustering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do you evaluate your model on first?

A

The training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is hyperparameter tuning?

A

Adjusting settings like number of neighbors in KNN or neurons/layers in ANN to optimize performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are examples of hyperparameters in different models?

A
  • KNN: Number of neighbors
  • ANN: Number of neurons and layers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

After tuning, what data is used to evaluate the model?

A

The testing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why evaluate using the test data more than once?

A

To ensure consistent and reliable performance across evaluations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the next step after evaluating the model?

A

Consider whether all errors are equal (evaluate error impact and cost).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly