WEEK Five Flashcards
Classifying a new case in BN
Take value for all of the attributes and introduce it as evidence- clamp the corresponding BN node to that value
Goal of a classifier
To predict an outcome class based on a set of attributes
Describe Machine Learning (ML)
Learning from experience with respect to task and performance measures
Model Training Process
Modifies overtime as you learn something- trying to build a better representation of the data
It is going to classify something for us
Model Testing Process
Makes a prediction without seeing the actual answer.
The human has access to the answer to know if the model is right or not.
Ex. Self driving cars.
Model Application
Model is used on data where the answer is not known
First we train, then we test, then we model application
What data to use for training vs testing- Holdout Method
Split the data into a training and testing set
Typically use more data for training than testing - but various splits possible
What data to use for training vs testing- Cross Validation Method
Equal size parts (folds)
We train and test on every portion of the data then calculate the average of all 4.
Calculating Accuracy & The Limitations
# of correctly classified cases/ total number of cases Does not tell us how good the model is at predicting a certain class just tells us about the overall model of performance.
Can be correct: When it predicts __ and is is __
Can be incorrect: When it predicts __ but it is actually ___
Confusion Matrix
How good is the model at making predictions Fine grain information on a performance for the class under consideration.
P REFERS TO THE MODEL UNDER PREDICTION N MEANS THE MODEL DIDN'T PREDICT IT TP - Predicts robin, it is robin TN- Predicts not robin, it is not robin FP- Predicts robin, it is a raven FN- Predicts not a robin, is a robin
Baseline Model
Predicts the most frequent outcome without taking into account any attributes- predicts the more frequent outcome
True positive rate (TPR)
True Positives/ True positives + false negatives
Good to have a high true positive rate- its predicting what it is supposed to predict. False Negatives have to be low in order to have a high TPR
Note * TPR is not TP
True Negative Rate (TNR)
True negatives / True negatives + false positives