Statistical Inference Flashcards

1
Q

What are two types of statistical inference?

A

Point estimation and interval estimation
Statistical inference is using estimated sample statistics and distributional properties of the estimator to make statements about unobserved population parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you construct a confidence interval when the mean is unknown?

A

Use the estimator for the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you construct a confidence interval when the variance is unknown?

A

Use the estimator for the variance or standard deviation (sigma hat) and replace the normal critical values with the values from Student’s t distribution using n - 1 degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When is the t distribution used?

A

When the observations come from a normal distribution but the mean and variance are unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the 95% confidence interval for a normal sample with unknown mean and variance?

A

[(mu hat) - tn-10.025(sigma hat), (mu hat) + tn-10.025(sigma hat)]
Divide sigma hat by root n when dealing with the distribution of the sample mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Central Limit Theorem?

A

For sufficiently large samples (n >= 40), the distribution of the sample mean will be approximately normal with mean mu and variance sigma squared over n, meaning estimators for mu and sigma can be used to construct CIs for the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What assumption must hold to construct confidence intervals for the variance?

A

Population distribution is normal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the chi squared distribution?

A

Χn-12 ~ (n-1)Sn-122

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the 1 - α level confidence interval for variance?

A

[(n-1)Sn-12α/2, n-12, (n-1)Sn-121 - α/2, n-12]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a hypothesis test?

A

A test of whether a parameter takes a particular value based on sample statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a statistical test?

A

A rule for whether to reject a certain assumption given the available data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the null and alternative hypotheses?

A

The null hypothesis makes a specific assertion about a population parameter, the null is tested against the alternative hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the test statistic?

A

The random variable which will determine the outcome of the test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the acceptance and rejection regions?

A

When the test statistic falls in the acceptance region the null hypothesis is not rejected, when it falls in the rejection region the null is rejected
A and R are mutually exclusive and exhaustive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are Type I and Type II errors?

A

Type I: rejecting a true null hypothesis
Type II: not rejecting a false null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the significance level (or size) of a test?

A

α = P(Type I error) = P(reject H0|H0 true)

17
Q

What is the relationship between Type I and Type II errors?

A

There is a trade-off: designing a test to minimise one will increase the other
This is the same trade-off as size and powerm we cannot choose both but we want the smallest β for a given alpha

18
Q

What is β?

A

P(Type II error) = P(not rejecting null|null false)

19
Q

What is the power of a test?

A

The probability of rejecting a false null
1 - β = P(reject null|null false)

20
Q

How can size and power be represented on a diagram?

A

If the null hypothesis is that the sample comes from a standard normal distribution and the alternative hypothesis is that the mean is actually two, a vertical line can be drawn at the critical value, then the area under the alternative pdf before the critical line is β = 1 - power and the area under the null pdf after the critical line is the size

21
Q

How do you conduct a hypothesis test for the mean?

A

Compare the test statistic (mu hat - mu)/(standard error) to the critical value (usually the t value with n - 1 degrees of freedom, if n > 40 then the normal value)
Watch out for CLT, one and two tailed tests

22
Q

How do you conduct a hypothesis test for the variance?

A

Assume/know normality
Compare the test statistic (n-1)s2/(null variance) to the critical value from the chi squared distribution

23
Q

How do you conduct a hypothesis test for proportion?

A

Assume/know normality (np, nq > 5, p not very close to 0 or 1)
Compare test statistic (p hat - p0)/sqrt(p0(1 - p0)/n) and compare to the critical value from the z distribution

24
Q

How do you construct a confidence interval for proportion?

A

For a (1 - α) CI of population proportion, use boundaries (p hat) +- zα/2 * sqrt((p hat)(1 - p hat)/n)