Regression Flashcards
Y = f(X) + e
Y = numeric output
e = error term
f = systematic information X provides about Y
Goal: find an f^(x) that approximates the true function f(x) as well as possible
MSE
Represents averaged squared difference of a prediction ^y = ^f(x) from its true value y.
MSE= E[(y-6f(x))^2]
How to minimize reducible error
Using a more sophisticated or appropriate model.
Collecting more relevant data or improving data quality.
Tuning model hyperparameters for better performance.
Bias
High bias can cause an algorithm to miss the relevant relations between features and target outputs (underfitting)
Variance
High variance may result from an algorithm modeling the random noise in the training data (overfitting)