Prediction Flashcards

Question 1

Q

Classification

Answer

A

Predicting when y=1 vs 0 (like churn). Tells you probability of y=1

Question 2

Q

Unconditional Probability

Answer

A

Unaffected by previous or future events

Question 3

Q

Law of Large Numbers

Answer

A

Increased observations means increased precision of prediction

Question 4

Q

Question 5

Q

Conditional Probability

Answer

A

Stronger prediction, rather than just 25% of customers churn you divide churn rate by senior and not senior

Question 6

Q

Bayes Rule

Answer

A

P(A|B) = P(A&B)/P(B)

P(A|B) = P(A)P(B)/P(B)

Question 7

Q

Threshold

Answer

A

Value between 0 and 1. The value for probability that will mean it is considered a 1

Question 8

Q

Sigmoid Function

Answer

A

H= @ + @ * tenure and then L(H) = 1/(1+e^-H)

gets you those predictions on multiple variables

Question 9

Q

Certainty through |H|

Answer

A

|H|> 2 okay
|H|> 5 quite sure
|H| > 10 super sure

Question 10

Q

Scatter Plotting Continuous Features

Answer

A

A method to analyze how your y value is affected by both. have feature 1 on the x, feature 2 on the y, and then two colours for whether each data point is y=1

Question 11

Q

Precision

Answer

A

How many false positives, if higher than less Fp

Tp/(Tp + Fp)

Question 12

Q

Accuracy

Answer

A

How many correct predictions (so true positives and true negatives)

(Tp + Tn)/(all obs)

Question 13

Q

Recall

Answer

A

How many false negatives, if higher than less Fn

Tp/(Tp+Fn)

Question 14

Q

F1

Answer

A

Balance between precision and recall

2((PR)/(P + R))

Question 15

Q

Underfitting

Answer

A

Model bad in testing, not flexible enough

High bias

Add observations or add features

Question 16

Q

Overfitting

Answer

Study These Flashcards

A

Poor predictions in test data, too flexible on training data, too custom

High variance

less features or regularization (make simpler) or increased observations

Make sure 70/30 data split

Question 17

Q

What to do when y= 0,1,or 2

Answer

Study These Flashcards

A

Make z0=1 only if y= 0 then do Pr(z0|X)

Do the same with a z1 and z2

Question 18

Q

Supervised Learning

Answer

Study These Flashcards

A

Predict a labelled attribute y with features

classification, log regression

ex. home sales price (cond mean at each value to get line)

Question 19

Q

Unsupervised Learning

Answer

Study These Flashcards

A

no labelled target attribute to predict

come up with cluster observation on similar features

ex. market segments, similar images, news article types

Question 20

Q

Anomaly Detection

Answer

Study These Flashcards

A

Evaluate whether certain data anomalous

fraud detect, defect detect

Question 21

Q

Reinforcement Learning

Answer

Study These Flashcards

A

Implements actions, generates data, updates algorithm

Trade off between exploit and explore

Recco systems

Question 22

Q

Machine Learning Benefits and Types

Answer

Study These Flashcards

A

More precise because it can use more obs, more flexible forms, more variables

predicts then decides off that

Supervised Learning
Unsupervised Learning
Anomaly Detection
Reinforcement Learning

Question 23

Q

External Validity

Answer

Study These Flashcards

A

Model trained in context A can be used in context B

Likely higher when data context is similar

Prediction Flashcards

(23 cards)