Supervised Learning Flashcards
1
Q
- What is being learned through model training and what is it learning from.
A
- Patterns, relationships, weights, parameters rules and functions which consists of features and labels. Learned from internet or any source of training data.
2
Q
- What is a prediction algorithm?
A
- Input to output
3
Q
- What are the different ways to judge the quality of a trained model?
A
Performance metrics (Confusion matrix, ROC and AUC curves, misfit)
4
Q
- Predictions on the training and the test datasets are called in sample and out-of-sample tests. Explain why both have a role in constructing a good model.
A
- In-sample – how good at calibrating
- Out-of-sample – how good at generalising
5
Q
- What trade-offs to think about when dividing our dataset into training and testing sets? Can this be done in more than one way?
A
- Training vs testing set size
- Yes, many ways such as random, time based or balanced
6
Q
- What factors to consider when selecting features for model training?
A
- How good the predictive skill on training set
7
Q
- What factors we consider when selecting a model to learn from the data?
A
- problem type, accuracy requirements, interpretability, complexity, scalability, feature space, and domain knowledge.
8
Q
- Supervision implies that the skilled operator is providing direction to the learning algorithm. What is this direction?
A
- Label data
- Defining objective
- Model performance
- Tuning algorithms
- Selecting features
9
Q
- Can the same structure be used to predict a binary label/target
A
Yes