4404 Flashcards

Question 1

Q

What is machine learning

Answer

A

Machine learns from exp E with respect to class of task T and performance measure of P. Performance P at tasks in T improves with exp E

Question 2

Q

Machine learning branches

Answer

A

Supervised | Unsupervised
–
Regression, Classification |
Regression for continous data and predictions, Classification for discrete data and predictions

Question 3

Q

Vectorization

Answer

A

use vectors and matrices to represent data and feed into an algorithm. This is called vectorization

Question 4

Q

Cause of underfitting
Solution to underfitting

Answer

A

Cause:
Occurs when model is too simple to learn underlying structure of data
Solution:
Select a more complex model
Feed better features to learning alg
reduce regularization

Leads to high bias

Question 5

Q

Cause of and solutions for overfitting

Answer

A

Cause:
Too complex of a model compared to size and noise of data
Solutions:
Use model with fewer parameters (linear vs high degree polynomial)
Constraint the model through regularization
Use more training data
Fix data errors and remove outliers

Question 6

Q

Cost/Performance Y-axis with respect to Jtrain and Jval meanings

Answer

A

Lower cost represents less errors in predictions
Higher cost reps more errors
Relation between Jtrain and Jval represent how well a model is generalizing to the data

Question 7

Q

Generalization

Answer

A

Ability of a model to make predictions on unseen data

Question 8

Q

How to mitigate overfitting

Answer

A

increase regularization parameter
increase data samples
use fewer features
early stopping/fewer epochs for training

Question 9

Q

How to mitigate underfitting

Answer

A

decrease regularization parameter
use a more complicated model/ more features

Question 10

Q

Regularization

Answer

A

Prevent overfitting by adding a penalty to model complexity
Discourage model from fitting the training data to closely | includes noise and outliers

Question 11

Q

Regularization parameter

Answer

A

Term added to cost function to train the model. Controls trade-off between fitting and keeping parameters small.
Larger param penalizes larger coefficients more strongly
Lower param allows model to fit training data more closely

Question 12

Q

L1 Regularization (Lasso)

Answer

A

Adds a penalty to the absolute value of the coefficients
encourages sparsity in the model, driving some coefficients to zero selecting a simpler model with fewer features

Question 13

Q

L2 Regularization (Ridge)

Answer

A

Adds a penalty equal to the square of the coefficients
encourages small coefficients, but not necessarily 0.

Question 14

Q

Question 15

Q

Linear Classification

Answer

A

Perceptron is a linear supervised binary classification alg that can apply when classes are linearly separable.
If training data can be separated by linear decision rule, they are linearly separable.

Question 16

Q

Hyperplane

Answer

Study These Flashcards

A

A decision boundary separating different classes.

Question 17

Q

Support vectors

Answer

Study These Flashcards

A

Data points closest to the hyperplane. Used to maximize distance between hyperplane and nearest data points from either class.

Question 18

Q

Linear SVM

Answer

Study These Flashcards

A

When datapoints are linearly separable, SVM finds a linear hyperplane to separate the classes

Question 19

Q

Non-linear SVM

Answer

Study These Flashcards

A

When data is not linearly separable, SVM uses tech ‘kernel trick’ to transform data into higher dimensional space where a linear hyperplane can be used