Final Flashcards

1
Q

Probability

A

The measure of the likelihood that an event will occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample Space

A

The set of all possible outcomes of an experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Event

A

A subset of the sample space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Compliment

A

Denoted as A’ or A^c; all outcomes not in A.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Addition Rule

A

P(A∪B)=P(A)+P(B)−P(A∩B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Multiplication Rule (for Independent Events)

A

P(A∩B)=P(A)⋅P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Conditional Probability

A

P(A|B)= P(A∩B) / P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Probability Mass Function (PMF)

A

Gives the probability of each possible value in a discrete random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Probability Density Function (PDF)

A

Gives the probability density of a continuous random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Expected Value (μ)

A

μ=∑(i=1, n) xi * P(X=xi)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mean (X^-)

A

(X^-) = [∑(i=1, n) xi] / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Variance (Var(x))

A

Var(x) = [∑(i=1, n) (xi-[X^-])^2] / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Standard Deviation (σ)

A

σ = SQRT (Var(x))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Summation Notation

A

∑(i=1, n) xi
“The sum of xi from i=1 to n.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Factorial Notation

A

n! = the product of all positive integers up to n
EX: 3! = 321 = 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Central Limit Theorem (CLT)

A

States that the distribution of the sum (or average) of a large number of independent, identically distributed random variables (typically n > 30) approaches a normal distribution, regardless of the original distribution

17
Q

Conditions for CLT

A

The random variables must be independent.
The sample size should be sufficiently large.
The original distribution’s shape doesn’t matter.

18
Q

CLT Formula for Sample Means

A

If X is a random variable with mean μ and standard deviation
σ, then the distribution of the sample mean (X^-) approaches a normal distribution with a mean, μ, and standard deviation, σ/SQRT(n)

19
Q

Maximum Likelihood Estimation (MLE)

A

A method for estimating the parameters of a statistical model that maximizes the likelihood function

20
Q

Likelihood Function

A

L(θ|X)=P(X|L) where:
L is the likelihood function
θ is the parameter
X is the data.

21
Q

Log-Likelihood Function

A

ℓ(θ)=ln(L(θ|X)); often used for easier calculations
Set dℓ/dθ =0 and solve for the parameter θ.

22
Q

Confidence Interval (CI)

A

A range of values constructed from sample data so that the population parameter is likely to occur within that range at a certain level of confidence

23
Q

Confidence Level

A

The probability that the interval contains the true parameter
Common choices are 90%, 95%, and 99%

24
Q

CI Formula for a Mean

A

(X^ˉ) ± Z⋅σ/SQRT(n) where:
Z is the Z-score corresponding to the desired confidence level.

25
Q

Hypothesis Testing Steps

A

State the null hypothesis (H0) and alternative hypotheses (H1).
Choose the significance level (α).
Calculate the test statistic.
Make a decision: reject or fail to reject the null hypothesis.

26
Q

Type I Error

A

Rejecting a true null hypothesis (false positive)

27
Q

Type II Error

A

Failing to reject a false null hypothesis (false negative)

28
Q

P-Value for Z TEST

A

1 Sided: P(Z > z) or P(Z < z) as appropriate.
2 Sided: P(|Z| > |z|) where z is the calculated Z-value.

29
Q

P-Value for T TEST

A

1 Sided: P(t > t_observed) or P(t < t_observed) as appropriate.
2 Sided: P(|t| > |t_observed|) where t_observed is the calculated t-value.

30
Q

CLT for Sample Proportions

A

If X is the number of successes in a sample of size, n, from a population with proportion, p, the distribution of
(^p) (sample proportion) approaches a normal distribution with mean, p and standard deviation SQRT [(p*(1-p)) / n].

EX: Z = (^p)-p / (SQRT [(p*(1-p)) / n])

31
Q

MLE for Normal Distribution (Known Variance)

A

If X1, X2,…, Xn are independent and identically distributed (i.i.d.) random variables from a normal distribution with mean, μ, and known variance, σ^2, the MLE for μ is the sample mean, (X̄)

32
Q

CI for Population Proportion

A

(^p) ± Z * SQRT [(^p * (1 - ^p)) / n

33
Q

Test Statistic for Population Mean (Known Variance) (Z TEST)

A

For testing a hypothesis about the population mean (μ) with known variance (σ^2), the test statistic is: Z = (X̄ - μ₀) / (σ / √n), where:
X̄ is the sample mean,
μ₀ is the hypothesized population mean.

34
Q

Z-Score (Z)

A

The Z-score is determined by the desired level of confidence. You can find this value in a Z-table or use statistical software.

EX: If you’re constructing a 95% confidence interval, use a Z-score corresponding to the critical value for a 95% confidence interval.

35
Q

Test Statistic for Population Mean (Unknown Variance) (T TEST)

A

t = (X̄ - μ₀) / (s / √n)

36
Q

Degrees of Freedom (df)

A

df = n - 1 where:
n is the sample size