Module 2 Flashcards by Russ Unknown

In __________, labeled training data refers to a dataset that includes both the input data and the corresponding correct output.

Supervised Learning

How well did you know this?

Not at all

Perfectly

This refers to data with both input data and a corresponding correct output.

Labeled Training Data

How well did you know this?

Not at all

Perfectly

______ is used to train a machine learning model to make predictions or decisions without being explicitly programmed.

Labeled Data

How well did you know this?

Not at all

Perfectly

The primary objective of ________ is to make a function or mapping the input variable with the output variable.

Supervised Learning

How well did you know this?

Not at all

Perfectly

What are the two categories under Supervised Learning?

Regression (Prediction) and Classification (Description)

How well did you know this?

Not at all

Perfectly

This category of Supervised Learning refers to algorithms that address classification problems where the output variable is categorical.

Classification

How well did you know this?

Not at all

Perfectly

This category of Supervised Learning predicts one of the possible class labels.

Classification

How well did you know this?

Not at all

Perfectly

What are some types of Classification?

Binary Classification - classification of two classes.
Multiple Classification - classification of three or more classes.

How well did you know this?

Not at all

Perfectly

What are the examples of Classification algorithms.

Random Forest Algorithm
Decision Tree Algorithm
Logistic Regression Algorithm
Support Vector Machine Algorithm

How well did you know this?

Not at all

Perfectly

This category of Supervised Learning handle regression problems where input and output variables have a linear relationship.

Regression

How well did you know this?

Not at all

Perfectly

This category of Supervised Learning predicts consecutive numbers (real numbers).

Regression

How well did you know this?

Not at all

Perfectly

What are some examples of Regression algorithms?

Simple Linear Regression Algorithm
Multivariate Regression Algorithm
Decision Tree Algorithm
Lasso Regression

How well did you know this?

Not at all

Perfectly

True or False.
The supervised ML has three phases: the usual training and validation, data prediction, and deployment.

False
(TWO phases only. The usual training and validation, followed by prediction.)

How well did you know this?

Not at all

Perfectly

True or False.
Model complexity is loosely tied to the variation of inputs contained within the training dataset.

False.
(it is INTIMATELY tied to the variation of inputs)

How well did you know this?

Not at all

Perfectly

True or False.
Regarding model complexity, the larger the variety of data points the data set contains, the more complex a model can be used without overfitting.

True.

How well did you know this?

Not at all

Perfectly

True or False.
Collecting more data points will yield more variety, so that larger datasets allow for building more complex models.

True.

How well did you know this?

Not at all

Perfectly

True of False.
Duplicating similar data points or collecting very similar data is usually helpful.

False.

How well did you know this?

Not at all

Perfectly

True or False.
In supervised learning, it is important to build a model on the training data and then be able to make accurate predictions on previously observed data.

False.
(make accurate predictions on NEW, UNSEEN data that has the SAME CHARACTERISTICS as the training set that we used.)

How well did you know this?

Not at all

Perfectly

If a model is able to make accurate predictions on unseen data, we say it is able to _________ from the training set to the test set.

Generalize

How well did you know this?

Not at all

Perfectly

This occurs when a model learns the training data too well, including its noise and outliers.

Overfitting

How well did you know this?

Not at all

Perfectly

_______ occurs when you fit a model too closely to the particularities of the training set and obtain a model that works well on the training set but is not able to generalize to new data.

Overfitting

How well did you know this?

Not at all

Perfectly

True or False.
An overfitted model performs exceptionally well on training data but poorly on new, unseen data.

True

How well did you know this?

Not at all

Perfectly

Choosing a model that is too simple is called “______”.

Underfitting

How well did you know this?

Not at all

Perfectly

This occurs when your model is too simple then you might not be able to capture all the aspects of and variability in the data, and your model will do badly even on the training set.

Underfitting

How well did you know this?

Not at all

Perfectly

True or False. An underfitted model performs poorly on the training data but excels in new, unseen data.

False. (underfitted models perform poorly on both training and new data)

True or False. The more complex the model, the better we can use it to predict data.

True.

True or False. The most complex models are almost always the most optimal choice for predictions.

False. (too complex models focuses too much on each individual point in the training set)

These are errors from having wrong / too simple assumptions in the learning algorithm.

Bias

These are errors resulting from sensitivity to the noise / fluctuations in the training data.

Variance

This is arguably the simplest machine learning algorithm.

k-NN (k-Nearest Neighbor) Algorithm

Building this model consists only of storing the training dataset.

k-NN (k-Nearest Neighbor) Algorithm

To make a prediction for a new data point, the algorithm finds the closest data points in the training dataset.

k-NN (k-Nearest Neighbor) Algorithm

True or False. In its simplest version, the k-NN algorithm only considers exactly two nearest neighbors, which is the closest training data point to the point we want to make a prediction for.

False. (exactly ONE nearest neighbor)

In k-NN Algorithm, _______ is used to assign a label when considering more than one neighbor.

Voting

What is the code for importing the k-NN Classifier?

from sklearn.neighbor import KNeighborsClassifier

What is the code for creating an instance of the k-NN Classifier?

variable_name = KNeighborsClassifier(n_neighbors = x)

In k-NN Classifier, as the number of k-neighbors increase, the model becomes _______.complex.

Less complex

In the regression variant of k-NN, the prediction of the model is the ____ or ______ of the relevant neighbors when using multiple neighbors

Average; Mean

The Squared Score (R^2) is also known as the?

Coefficient of Determination

This is a measure of goodness of a prediction for a regression model.

Squared Score

What are the two important parameters to eh KNeighbors Classifier?

- Number of neighbors - How you measure the distance between data points (Euclidean distance is used by default)

What are the strengths of the KNeighbors Classifier?

- Easy to Understand - Works well without any special adjustments - Suitable as first-time models

What are the weaknesses of the KNeighbors Classifier?

- If the number of features or samples is large, the prediction is slow and data preprocessing is important - does not work well with sparse data sets.

These models generate a formula to create a best-fit line to predict unknown values.

Linear Models

These models make a prediction using a linear function of the input features.

Linear Models

True or False. Linear models are called linear because they assume that there is a linear relationship between the outcome variable and each of its predictors.

True

This is the algorithm for solving regression problems in linear models.

Linear Regression

What are the final output of linear regression models?

Numeric values (Numeric predictions)

This linear model is used for classification problems.

Logistic Regression

Linear regression is also known as "_________"

Ordinary Least Squares (OLS)

This is the simplest and most classic linear method for regression.

Linear Regression

This model finds the parameters w and b that minimize the mean squared error between predictions and the true regression targets, y, on the training set.

Linear Regression

This is the sum of the squared differences between the predictions and true values.

Mean Squared Error (MSE)

It is one of the most commonly used alternatives to standard linear regression.

Ridge Regression

In this model, the coefficients (w) are chosen not only so that they can predict well on the training data, but also to fit an additional constraint.

Ridge Regression

In this model, the magnitude of coefficients to be as small as possible; in other words, all entries of w should be close to zero (approaches zero, but not zero).

Ridge Regression

Each feature should have as little effect on the outcome as possible (which translates to having a small slope), while still predicting well. This constraint is an example of what is called _________.

Regularization

________ means explicitly restricting a model to avoid overfitting.

Regularization

The particular kind of regularization used by ridge regression is known as "________".

L2 Regularization

True or False. In ridge regression, if α is smaller, the penalty becomes smaller and w should be smaller.

False. (If α is BIGGER, the penalty becomes BIGGER and w should be SMALLER)

It is an alternative to Ridge for regularizing linear regression.

Lasso Regression

The particular kind of regularization used by lasso regression is known as "_______".

L1 Regularization

True or False. In L1 Regularization, coefficients can reach zero which means that certain features are entirely ignored by the model.

True

True or False. Lasso is generally preferred over Ridge because L1 penalty is preferred over L2 penalty.

False. (Ridge is preferred over Lasso, L2 penalty is preferred over L1 penalty.)

Module 2 Flashcards

(64 cards)