Lecture 6 - Machine Learning Flashcards
What is the training process in machine learning?
The training process is an iterative optimization problem. This leaves us with the “best” model we can get.
How do we measure a classification models performance?
Accuracy
Precision
Recall
F-1 Score
How do we measure a linear regression models performance?
Mean Squared Error (MSE)
R2 score
What is accuracy?
Correct / total
What are some weaknesses of accuracy?
Does not discriminate between type I and type II errors, does not handle class imbalance
What is precision?
True positives / Truepositives + Falsepositives
It is the fraction of relevant instances among the retrieved instances.
What is recall?
True positiveness / truepositive + falsenegative
Recall is the fraction of relevant instances that have been retrieved over the total amount of relevant instances.
What is F-1 Score?
2 (( Precision * recall) / (precision + Recall)
Pros: It’s a single number that captures a lot of detail Cons: It’s a single number that captures a lot of detail, so some is lost or “glossed over”
What is a Type I error?
A false positive
What is a Type II error?
A false negative
What is overfitting?
Abstractly, overfitting is when you come close to “memorizing” your data set. It means you perform very well on the training data set, but your model doesn’t generalize well to unseen data