General ML Knowledge Flashcards

1
Q

Linear Model

A

A model that is specified by a linear combination of features. In other words, the outcome is dependent on the sum of inputs (predictors) and parameters (Betas)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Maximum Likelihood Estimation

A

A method of estimating the parameters of a model such that they maximize the likelihood that it produced that result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

F1 Score

A

Harmonic Mean of precision and recall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Macro Average

A

Simple average of accuracy/precision/recall or roc/auc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Micro Average

A

Weighted average of accuracy/precision/recall or roc/auc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Precision Formula

A

TP/(TP+FP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Recall

A

TP/(TP+FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Bias

A

Refers to model accuracy – given a single input, how off it is from true prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Variance

A

Refers to model robustness – given single input what is the range of outputs that the model could predict.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Boosting

A

Sequential Learning where a series of weak learners are put together to make a strong learner.

Sampling without replacement. Any mis-classified data flows down to subsequent learners to be reclassified with additional weights.

Can be prone to overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bagging

A

Learning through sampling with replacement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Underfitting

A

High bias low variance. The model predicts consistently but incorrectly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Overfitting

A

Low bias high variance – The model is very sensitive to noise in data and as such predicts inconsistently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Generative Models

A

P(X|Y) * P(Y) – models that learn the joint probability distribution.

eg. Naive Bayes

as opposed to Discriminative models that look at P(Y|X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Discriminative Models

A

Models that look at P(Y|X) – model directly from trianing data that looks at the most likely class. Logistic regression is an example of this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Bias (Formula)

A

Difference between prediction and actuals