7.1 - Types of Learning and Overfitting Problems Flashcards
_______ _________ uses labeled training data to guide the ML program toward superior forecasting accuracy.
Supervised learning
Typical tasks for supervised learning include _________ and __________.
classification ; regression
If the target variable is ___________, the model involved is a regression model.
continuous
Multiple regression is an example of ________ learning.
supervised
Classification models are used in cases where the target variable is ________ or _________ (e.g., ranking).
categorical ; ordinal
Algorithms can be designed for __________ classification (e.g., classifying companies as likely to default vs. not likely to default) or _____________ classification (e.g., a ratings class for bonds).
binary ; multicategory
In ____________ learning, the ML program is not given labeled training data; instead, inputs (i.e., _________) are provided without any _____________ about those inputs.
unsupervised ; features ; conclusions
___________ is an example of an unsupervised ML program.
Clustering
In __________ learning, in the absence of any ________ variable, the program seeks out structure or interrelationships in the data.
unsupervised ; target
_______ _________ algorithms are used for complex tasks such as facial (image) recognition, natural language processing, etc.
Deep learning
Programs that learn from their own prediction errors are called _________ ____________ algorithms.
reinforced learning
Deep learning algorithms and reinforced learning algorithms are based on ________ ________ , a group of ML algorithms applied to problems with significant _____________ .
neural networks; nonlinearities
__________ is an issue with supervised ML that results when a large number of ________ (i.e., independent variables) are included in the data sample.
Overfitting; features
In overfitting, ___________ is misperceived to be a pattern, resulting in high __-_____ __-________.
randomness; in-sample R-squared
Overfitting has occurred when the noise in the ________ ________ seems to improve the model fit.
target variables
Overfit models do not _________ ________ to new data (i.e., ____-___-____ ___-______ will be low).
generalize well ; out-of-sample R-squared
You “train” the model using __-_____ data.
in-sample
When a model _______ ______, it means that the model retains its explanatory power when it is applied to new, (i.e., ___-__-_____) data.
generalizes well; out-of-sample