Standard errors and confidence intervals Flashcards

1
Q

Histogram of means vs histogram of individual heights?

A

Histogram of sample means will be much less dispersed. As increase sample sizes (but same NUMBER of samples) this will become even less dispersed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantifying how the distribution of sample means becomes more concentrated as the sample size increases?

A

Standard deviation for the means of samples of size 10 will be 1.54cm, say, and 0.20cm for size 1000. This figure measures how precisely the sample mean estimates the population mean and is called the SEM or SE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Equation for standard error?

A

SE = σ/√n. In reality, use s not σ. =SD for the distribution of sample means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Significance of √ in SE?

A

SE of sample size 1000 will be 10-fold less than sample size 10. This is different to SD which does not get bigger or smaller with sample size, just becomes a better estimate (hence why it is used, not range for inference). Also means to half SE must quadruple sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Quoting mean ± SE?

A

Bad practice as suggests that μ must be within m±SE. Much better to use 2SE (95%) as intervals will be wider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Premise of confidence intervals?

A

Note that 95% of values lie within μ±2σ. As the distribution of sample means (theoretical) has mean μ and SE=SD, there is a 95% chance that the sample mean m is between μ±2SE, which is equivalent to saying 95% chance that μ is within m±2SE. This spread will be wider for smaller sample sizes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Confidence intervals and interval estimates?

A

95% CIs also called interval estimate of μ. Distinct from m, which is a point estimate. Interval estimate better reflects the uncertainty. Significance clear: if trial result was that A gave BP reduction compared to B of -1mmHg, hard to interpret. If CI turned out to be -3, 5 then no material difference likely; if CI were -30, 32 then either could be considerably better.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SE and practicalities of sample size?

A

Means can collect large sample and get an SE as small as experimenter chooses. However, only worth getting it to stage that is clinically significant i.e. who cares if BP differs by 1?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sample means of normal and non-normal data?

A

As shown, means of samples from normal data have a normal distribution (basis of SE). However, means of samples from variables that are not normal often have a very close-to-normal distribution. Described by Central Limit Theorem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Central premise of hypothesis testing?

A

That any difference between the two values (in this case treatment means) is due to chance (i.e. that the populations are identical, and therefore should differ only by sampling error. Important to note that just because a test says that difference could be due to chance, does not means that it is due to chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What actually is a P value?

A

If a difference between samples is large, measures the probability of the observed difference occurring if the null is true. Result is either that have seen an unlikely event or that the null is false.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Type 1 error rate?

A

Same as the P value (probability of making the error of stating that there is a difference between the two groups when there is not). I.e. FPR? Incorrectly reject the null hypothesis when should not have done.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Student’s T-test?

A

m1-m2/SE. Uses premise that most values of m1-m2 will fall within 2SE if null is true, so ratio of m1-m2/SE bigger or smaller than 2 will be unusual.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Type 2 error?

A

False negative (say there is not a difference between the two groups when there is i.e. failing to reject a false null hypothesis).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Understanding the result of a T test?

A

Get a ratio; plot it on distribution of statistics if null is true; shaded area is % of all t-statistics that are more extreme than the observed value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If get P value of say, 0.45?

A

Does not mean that the null is true; just means that does not supply evidence against it. This could be because SE is too big because sample size is too small, for example. The data are therefore compatible with μ1-μ2=0, but also a range of values around 0. This range is given with the confidence interval.

17
Q

Assumptions in unpaired T test?

A

That each sample is drawn from a normal population (determined by μ+σ), and that these populations have a common SD, σ. The null tests usually that the means of these populations are the same.

18
Q

SE equation in unpaired T test?

A

If the two samples have size N and M, then SE (of difference in means) is σ*√1/M+1/N, where σ is a pooled estimate from the SD of each group. Will be between the two SDs.

19
Q

What does paired T test do differently?

A

Looks at differences between the paired values, and assumes that these differences come from a population with zero mean, rather than the means of each group.

20
Q

Equation for paired t-statistics?

A

Technically the same as before i.e. m1-m2/SE (although SE calculated differently). In reality, expressed as d/SE where d is the mean of the differences. This is identical to just having the means of the columns as in unpaired, but the SE is calculated differently.

21
Q

SE in paired and unpaired?

A

Unpaired SE is based on SD which measures the variation between patients; in paired just looking at intrapatient variation which is obviously smaller. So instead = SD (of the differences)/√N. Pairing gives more sensitive experiment but only if reflected in this analysis.

22
Q

Assumptions underlying paired T test?

A

Only one: that the differences come from a normal distribution.