L8 - Gradient Descent and Classifier Performance Flashcards

Question 1

Q

What is fitting a model?

Answer

A

The process of finding parameters such that the model fits to to the data.

Question 2

Q

In classification, what are 2 sub-optimal approaches to fitting a model?

Answer

A

Random Sampling Values
1. Grid Search

Question 3

Q

Through what process can we improve parameter fitting?

Answer

A

Learning
1. Minimise error and maximise fit

Question 4

Q

When parameter fitting, what is the goal?

Answer

A

To minimise error of the model
1. I.e find the minimum error point

Question 5

Q

When establishing the minimum error using a Generic Parameter Fitting model, when do we stop?

Answer

A

When the error starts increasing again.

Question 6

Q

In Deterministic Parameter Fitting, when do we stop?

Answer

A

When we reach N steps since everything is determined.

Question 7

Q

Describe the Stochastic Parameter Fitting method…

Answer

A

Pick a random point P1 and calculate L2 norm
1. Pick a random point P2 near P1 and calculate L2 norm
2. Repeat until L2 norm is less than an error threshold or whether N steps can been taken.

Question 8

Q

What is the purpose of the gradient descent algorithm?

Answer

A

Iteratively find the minimum of a function or model.
1. In this context, to find the minimum error of the classification model.

Question 9

Q

When does the gradient descent algorithm stop?

Answer

A

When the error is 0

Question 10

Q

Describe the steps of the gradient descent algorithm

Answer

A

Start at random point P1
1. Calculate loss of P1
2. Choose P2 in the direction where the loss has the steepest gradient
3. Repeat until loss value is below a threshold or N steps have been taken

Question 11

Q

Why do we want to minimise the loss function of a model?

Answer

A

In order for the model to better fit the data

Question 12

Q

What is Precision? And what does it mean if a model has High Precision?

Answer

A

Positive Prediction Value -> I.e Those labeled as positive is indeed positive
1. Confusion matrix shows few number of False Positives

Question 13

Q

What is Recall? And what does it mean if a model has High Recall?

Answer

A

Recall is sensitivity -> Out of all the actual positive instances, how many did the model correctly identify?
1. Measures how well the model avoids FN’s.
2. Model is good at finding positive instances, but may not be precise.

Question 14

Q

What are the equations for Precision and Recall?

Answer

A

Precision = TP / ( TP + FP )
1. Recall = TP / ( TP + FN )

Question 15

Q

What does it mean if a model has High Recall and Low Precision?

Answer

A

Most positive instances are classified, but there’s likely to be many false positives.

Question 16

Q

What does it mean if a model has Low Recall and High Precision?

Answer

Study These Flashcards

A

Misses a lot of positive instances, but the TP’s predicted are likely correct.

Question 17

Q

What is the F1 measure?

Answer

Study These Flashcards

A

Score that combines the Precision and Recall measures.

Question 18

Q

In a classification model, what is the difference between Specificity and Sensitivity?

Answer

Study These Flashcards

A

Sensitivity -> Measures the models ability to identify positive instances, but may not be precise.
1. Specificity -> Measures TN accuracy.

Question 19

Q

What does the ROC curve show?

Answer

Study These Flashcards

A

Combines specificity and sensitivity to show the trade off between TP and FP rate

Question 20

Q

What is the threshold of the ROC curve? How do we decide it?

Answer

Study These Flashcards

A

Threshold is hyper parameter that is set to create new TP and FP values.
1. Threshold is a design choice -> Asks the question, which mistake is worse, a false positive or a false negative?

Question 21

Q

Regarding TP and FP, what is the perfect classifier?

Answer

Study These Flashcards

A

High TP and Low FP

Question 22

Q

What is the ideal value of Area Under Curve?

Answer

Study These Flashcards

A

Close to 1 as possible -> Higher TP rate

Question 23

Q

How is a ROC curve created?

Answer

Study These Flashcards

A

Iteratively move threshold across the classification graph, calculating the sensitivity and specificity at each point.
1. Plot the results on a scatter graph.
2. The result is the ROC curve

Question 24

Q

What is the main metric that can be calculated from the ROC curve?

Answer

Study These Flashcards

A

Area under curve -> Tells us ability to calculate True Positives.

L8 - Gradient Descent and Classifier Performance Flashcards

(24 cards)