Probability and Stats Flashcards

Question 1

Q

Law of Large Numbers

Answer

A

As sample size grows, sample mean becomes closer to population mean

Question 2

Q

Bayes Rule

Answer

A

P(A|B) = P(B|A) P(A)/P(B)

Question 3

Q

Probability vs Likelihood

Answer

A

Probability =
P(X > 32 | mu, std_dev)
Likelihood = finding best params of models/best distribution given observation(s)
e.g., L(mu=sth, std=sth|X)

Question 4

Q

Bayes Nets

Answer

A

DAG + variables conditioned on others (local conditional probabilities)
e.g., rain -> cricket -> traffic

Question 5

Q

Bayes Error

Question 6

Q

Markov Decision Process

Question 7

Q

Hidden Markov Models

Answer

A

Hidden State (X) = Markov Process
Observable (Y) = only depends on current state of X
Accounts for temporal relations between hidden states and how those states emit observations
e.g., Language modeling, Y = word, X = part of speech
Tall player fell
Hidden Markov Process: P(adj, noun, verb) = P(adj) P(noun | adj) P(verb | noun)
P(“Tall player fell”) = P(Tall | adj) P(player | noun) P(fell | verb)

Question 8

Q

Full joint distribution from Bayes Net

Answer

A

P(x1, x2, … xn) = prod_i (x_i | parents(x_i))

Question 9

Q

Decoding in HMM

Answer

A

Find the most probable sequence of hidden states, given a sequence of observations

Question 10

Q

Confidence Interval

Answer

A

Collect sample means e.g., using bootstrapping (sampling with replacement)
Define a 95% (typical value) interval i.e., interval covering 95% of those sample means
That’s the CI

Question 11

Q

p-value

Answer

A

H0 = no difference; H1 = different
If p < alpha, then reject H0

alpha = acceptable False Positive Rate
i.e., chances of saying there is a difference, even though there is not in reality

cons: not well calibrated

Question 12

Q

Student’s t-test

Answer

A

t = (x_bar - mu) / (estimate_of_population_std_dev/sqrt(n))

Test: if the calculated t is outside the confidence interval for given confidence level (1-alpha) and sample size, then reject H_0

Cons: assumes equal variances of both populations
Pros: For smaller sample sizes. t-distribution approaches normal distribution for large sample sizes

Question 13

Q

Welch’s t-test

Answer

A

Doesn’t assume equal variances. Assumes normal distribution just like student’s t-test

Question 14

Q

Central Limit Theorem

Question 15

Q

Binomial distribution

Answer

A

Pr(x | n, p) = C(n, x) p^x (1 - p) ^ (n - x)
x = # of successes (e.g., prefers orange fanta) out of n, given success probability p

p^x (1 - p) ^ (n - x) = proba that ‘x’ events are successful for a given configuration,
C(n, x) total configurations in which x successes out of n are observed
e.g., orange vs grape fanta

Question 16

Q

Poisson distribution

Question 17

Q

ANOVA

Answer

A

Analysis of Variance: Use variance to study means for 2 or more populations
- H_0 = all means are the same; H_1 = at least one differs from the others

F = variance between treatments/variance within treatments = MS_treatments/MS_ERRORS

Look up threshold for (# treaments -1) and (# of samples within a group - 1) to see if the differences are significant

Question 18

Q

Expectation Maximization

Question 19

Q

Maximum Likelihood Estimation

Question 20

Q

Law of total variance

Answer

A

EV VE
Var(Y) = E(Variance(Y|X)) + Var(E(Y | X))