Lecture 1 Flashcards
Statistical vs machine learning
Statistical learning focuses on inferences while machine learning focuses on prediction.
Prediction
Predictions: Ability to predict what the responses are going to be to future input variables
Supervised
Characterized by the presence of “answers” in the data set which are utilized to supervise the algorithm
Training vs Testing
Training: Will be the “Old data” used to fit data
Testing: Is strictly used to evaluate performance
Inference
Gain insights about underlying data generation process, relationships between variables, or the impact of certain factors.
To draw conclusions or make statements about the population or data generating mechanism
unsupervised
Attempts to learn relationships and patterns from data that are not labeled in any way.