Statistics Flashcards

1
Q

Probability vs Likelihood

A

Probability:
What is the chance of something occurring given a sample distribution of data
ex. P(height > 170 | mu = 170, sigma = 3.5)

Likelihood:
What is the best distribution of data given a value
ex. P(mu = 170, sigma = 3.5 | height > 170)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some examples of non-normal distributions?

A

Uniform: All values equally likely
Exponential: Number of coin flips before heads
Skewed: Long-tailed normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a p-value?

A

P-value:
The probability that the event you observed occurred by random chance

Statistical significance:
When an event did not occur by random chance alone

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How many ways can you split 10 players in two teams to play 5-a-side football? (Order of teams does not matter)

A

= (10 choose 5) / 2
= 252 / 2
= 126

252 if order matters
126 if order does not matter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Bayes theorem? Show formula.

A

Formula for computing the conditional probability.

P(A|B) = P(B|A)P(A) / P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Steps of significance testing

A
  1. Define Null and Alternative Hypothesis
  2. Set threshold (alpha)
  3. Calculate p-value
  4. If p-value < threshold. Reject the null (statistically significant)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Combination vs. Permutation? Show formula’s.

A

Combination. Choose from set of values, order does not matter.
nCm = n! / (n!(n-m)!)

Permutation. Choose from set of values, order matters.
= n! / (n-m)!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the central limit theorem?

A

If you take a sufficiently large sample with replacement and take a sampled statistic, and repeat, the distribution of the sample statistic will be normally distributed.

ML relationship:
- Confidence intervals, Comparing models to one another (t-tests, Annova)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are Z-tests, T-tests and ANOVA tests? When would you use each one?

A

Tests for statistical significance.

T-test (almost always this):
- Normal distribution where we don’t know the standard deviation
- Random sample (<30 points)

Z-test:
- If you know the standard deviation
- Population (>100 points)
- This assumes the variance is identical to any other sample taken from the same population

ANOVA (analysis of variance)
- More than 2 distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the Harmonic and Geometric means?

A

Harmonic: Average of rates (speeds).
= n / ((1/x1) + (1/x2) + (1/x3))

Geometric: Average of growth rates (bacteria growth, return on investment).
= nthroot(x1x2x3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is statistical significance? What is a p-value?

A

Statistical Significance:
- When some event happens not due to chance

P-value
- The probability that the observed event was random chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the difference between PDF and CDF?

A

PDF - The probability that a random variable is exactly x

CDF - The probability that a random variable is less than or equal to x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is entropy? How does this relate to ML?

A

Entropy is the amount of uncertainty in a random process. (Lower the better)

Using log-loss/categorical cross entropy/binary-cross-entropy all measure entropy. We can use these to measure how well our model makes predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

MLE vs MAP

A

Maximum Likelihood Estimate:
- Find the parameters that give us the best distributions for our predictions

Maximum A Posteriori:
- Optimize params, but we have some prior knowledge
- ie. We know the probability in the real world. Given this probability, what are the optimal parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly