Unit 4 Flashcards

1
Q

Why isn’t it good to report a statistic that you have 100% confidence in?

A

It will usually be meaningless. We need to make sure we can make an interval where we are confident it overlaps the parameter while not making it overly wide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is statistical inference?

A

Using sample data to make inferences about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do we use confidence intervals instead of points?

A

Mathematically, the parameter and statistic will never be exactly equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does confidence mean in statistics?

A

If I use the same method to make my interval, a target % of the time it will contain the parameter (a statement on how often our method is going to work)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the form of confidence intervals?

A

estimate +- sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an estimate?

A

our best guess at the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the sampling error?

A

(margin of error): how much inaccuracy we could generally expect due to sampling variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a general interpretation of a confidence interval?

A

If we repeatedly took samples of the same size from the same population and constructed confidence intervals in a similar manner then C% of all such intervals would contain the true mean mu (where C is the level of confidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What happens if we multiply the sampling error by 1/k?

A

The sample size is multiplied by k^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does T represent?

A

the number of standard deviations x bar is from mu with n - 1 degrees of freedom (for every additional thing we’re estimating, we lose some precision)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the standard error?

A

s/sqrt(n): it estimates the standard deviation of xbar
or
sqrt(p(1-p-hat)/n): it estimates the standard deviation of p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Where do you use the t-statistic instead of z?

A

Where sigma is unknown. We will use s in place of sigma which means we need ta/2 values instead of Za/2 values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does changing the sample size do to a t distribution?

A

A larger sample size will make it narrower (centre stays the same)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are degrees of freedom?

A

For each element in our sample, we can do one more thing with our analysis. The t-distribution requires us to estimate sigma by s so we “use up” one degree of freedom leaving us with n - 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are some differences between the t-distribution and z-distribution?

A
  • the t-distribution is smaller near the center
  • the t-distribution has wider tails
  • as n increases it moves closer to the z-distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Can we find confidence intervals for distributions that are not normally distributed?

A

Yes, but only with large sample sizes and they will be approximate

17
Q

What is p-star?

A

Either an educated guess at the true value (maybe from historical data) or, 0.5