Exam Flashcards by Karmen Fabel

What is a classification problem

A problem that requires machine learning algorithms that learn how to assign a class label to examples from the problem domain

How well did you know this?

Not at all

Perfectly

What is a regression problem

A problem that learns to predict continuous variables

How well did you know this?

Not at all

Perfectly

What algorithms are used for Regression Problems?

Linear Regression
Support Vector Regression
Regression Tree

How well did you know this?

Not at all

Perfectly

Give an example of a Classification problem

Getting a machine to classify different images such as the difference between apple[1,0,0], banana[0,1,0] and cherry[0,0,1]

How well did you know this?

Not at all

Perfectly

What is Underfitting?

When a model cannot capture underlying trend of the data

How well did you know this?

Not at all

Perfectly

Why does Underfitting occur?

Algorithm does not fit/ Not enough data

How well did you know this?

Not at all

Perfectly

What happens with the Bias& Variance in Underfitting

High bias and low variance

How well did you know this?

Not at all

Perfectly

What is Bias?

Assumptions made by a model to make a function easier to learn

How well did you know this?

Not at all

Perfectly

What is Variance?

Training data obtains a low error, and then changing training data obtains a high error

How well did you know this?

Not at all

Perfectly

How to prevent Underfitting

Increase model complexity
Increase number of features (feature engineering)
remove noise
Increase epochs

How well did you know this?

Not at all

Perfectly

What is overfitting?

Trained with a lot of data, the model starts to learn from the noise and inaccurate data entries. The model has too much freedom and builds an unrealistic model

How well did you know this?

Not at all

Perfectly

What is overfitting in terms of variance and bias

High variance and low bias

How well did you know this?

Not at all

Perfectly

How to reduce overfitting

Increase training data
reduce model complexity
early stopping
L1&L2 regularization
Dropouts if neural network

How well did you know this?

Not at all

Perfectly

What is regularisation

the technique of calibrating machine learning models to minimize the loss and prevent over or underfitting

How well did you know this?

Not at all

Perfectly

What noise mean?

The data points in a dataset that don’t really represent the true properties of your data

How well did you know this?

Not at all

Perfectly

What does Bias mean in terms of regularisation?

Study These Flashcards

the difference between the actual and predicted values Less consideration to data pattern = oversimplified and underfit models

What does Variance mean in terms of regularisation

Study These Flashcards

Measure of flexibility in the model. Decides how sensitive the model is to change based on the patterns in the input data

What happens to the training and testing error when the bias is high

Study These Flashcards

They will also be high

What happens to the training and testing error when the variance is high

Study These Flashcards

They will be low

Name the two main types of regularization techniques

Study These Flashcards

Ridge and Lasso Regulation

What is Ridge regularisation

Study These Flashcards

Modifies over or underfitted models by adding the penalty equivalent to sum of the squares of the magnitude of coefficients

what is Lasso Reggression

Study These Flashcards

Modifies the over fitted/underfitted models by adding a penalty = to the sum of the absolute values of coefficients

What is Dropout In regularisation

Study These Flashcards

Randomly selected neurons are ignored during training. Dropped out randomly. therefore their contribution is temporally removed

What happens as a neural network learns

Study These Flashcards

Weights settle into their context within the network. Weights are tuned for specific features, providing some specialization. Neighboring neurons come to rely on this specializations which can result in a fragile model too specialized for training the data.

How does dropout help with overfitting

1. Neurons cannot rely of one input as it may dropout at random - this reduces Bias due to over-relying on one input 2. neurons will not learn redundant details of inputs

The concept of concept attainment requires the following 5 categories

1. identify task 2. nature of examples used 3. validation procedure 4. consequences of categorizations 5. nature of imposed restriction

what is an decision tree

A supervised learning algorithm (regression and classification) Tree structure with roots, nodes and branches like a flowchart

Advantages of decision trees

- easy to interpret -no data preparation required -more flexible

Disadvantages of decision trees

-prone to overfitting -high variance -more costly

Exam Flashcards

(30 cards)