model evaluation Flashcards
What is the requirement for a goodness metric for backpropagation to work?
has to be differentiable
How do you define the goodness of your model?
Ability to generalize to unseen data
What can affect generalizability?
- algorithm
- hyperparameter values
- training data
- random initialization (can affect accuracy)
What are 4 methods of quantifying generalization goodness?
1) accuracy, precision, recall
2) mean absolute error
3) RMSE
4) area under ROC curve
Come up with a metric for Predicting lung cancer from chest x-rays
Recall, precision, FN/P
Come up with a metric for predicting high school GPA
MAE
Come up with a metric for evaluating search engine results
recall
Come up with a metric for predicting the location of an object in 3D space
euclidean/cosine distance
Come up with a metric for predicting if twitter user is a liberal or conservative
AUC
What is sensitivity?
True positive rate, also recall
What is specificity?
True negative rate
What is precision?
A measure of exactness. The percentage of tuples labeled as positive and are actually such
What is the F-measure?
Harmonic mean of precision and recall, giving equal weight to each
What metrics can you use for regression?
MAE, RMSE
Why do we take the square of metrics?
Taking the square results in steeper gradients when searching and gives higher penalities. RMSE accentuates the impact of outliers. Otherwise might take too long t