Standard errors and confidence intervals Flashcards

Question 1

Q

Histogram of means vs histogram of individual heights?

Answer

A

Histogram of sample means will be much less dispersed. As increase sample sizes (but same NUMBER of samples) this will become even less dispersed.

Question 2

Q

Quantifying how the distribution of sample means becomes more concentrated as the sample size increases?

Answer

A

Standard deviation for the means of samples of size 10 will be 1.54cm, say, and 0.20cm for size 1000. This figure measures how precisely the sample mean estimates the population mean and is called the SEM or SE.

Question 3

Q

Equation for standard error?

Answer

A

SE = σ/√n. In reality, use s not σ. =SD for the distribution of sample means.

Question 4

Q

Significance of √ in SE?

Answer

A

SE of sample size 1000 will be 10-fold less than sample size 10. This is different to SD which does not get bigger or smaller with sample size, just becomes a better estimate (hence why it is used, not range for inference). Also means to half SE must quadruple sample size.

Question 5

Q

Quoting mean ± SE?

Answer

A

Bad practice as suggests that μ must be within m±SE. Much better to use 2SE (95%) as intervals will be wider.

Question 6

Q

Premise of confidence intervals?

Answer

A

Note that 95% of values lie within μ±2σ. As the distribution of sample means (theoretical) has mean μ and SE=SD, there is a 95% chance that the sample mean m is between μ±2SE, which is equivalent to saying 95% chance that μ is within m±2SE. This spread will be wider for smaller sample sizes.

Question 7

Q

Confidence intervals and interval estimates?

Answer

A

95% CIs also called interval estimate of μ. Distinct from m, which is a point estimate. Interval estimate better reflects the uncertainty. Significance clear: if trial result was that A gave BP reduction compared to B of -1mmHg, hard to interpret. If CI turned out to be -3, 5 then no material difference likely; if CI were -30, 32 then either could be considerably better.

Question 8

Q

SE and practicalities of sample size?

Answer

A

Means can collect large sample and get an SE as small as experimenter chooses. However, only worth getting it to stage that is clinically significant i.e. who cares if BP differs by 1?

Question 9

Q

Sample means of normal and non-normal data?

Answer

A

As shown, means of samples from normal data have a normal distribution (basis of SE). However, means of samples from variables that are not normal often have a very close-to-normal distribution. Described by Central Limit Theorem.

Question 10

Q

Central premise of hypothesis testing?

Answer

A

That any difference between the two values (in this case treatment means) is due to chance (i.e. that the populations are identical, and therefore should differ only by sampling error. Important to note that just because a test says that difference could be due to chance, does not means that it is due to chance.

Question 11

Q

What actually is a P value?

Answer

A

If a difference between samples is large, measures the probability of the observed difference occurring if the null is true. Result is either that have seen an unlikely event or that the null is false.

Question 12

Q

Type 1 error rate?

Answer

A

Same as the P value (probability of making the error of stating that there is a difference between the two groups when there is not). I.e. FPR? Incorrectly reject the null hypothesis when should not have done.

Question 13

Q

Student’s T-test?

Answer

A

m1-m2/SE. Uses premise that most values of m1-m2 will fall within 2SE if null is true, so ratio of m1-m2/SE bigger or smaller than 2 will be unusual.

Question 14

Q

Type 2 error?

Answer

A

False negative (say there is not a difference between the two groups when there is i.e. failing to reject a false null hypothesis).

Question 15

Q

Understanding the result of a T test?

Answer

A

Get a ratio; plot it on distribution of statistics if null is true; shaded area is % of all t-statistics that are more extreme than the observed value.

Question 16

Q

If get P value of say, 0.45?

Answer

Study These Flashcards

A

Does not mean that the null is true; just means that does not supply evidence against it. This could be because SE is too big because sample size is too small, for example. The data are therefore compatible with μ1-μ2=0, but also a range of values around 0. This range is given with the confidence interval.

Question 17

Q

Assumptions in unpaired T test?

Answer

Study These Flashcards

A

That each sample is drawn from a normal population (determined by μ+σ), and that these populations have a common SD, σ. The null tests usually that the means of these populations are the same.

Question 18

Q

SE equation in unpaired T test?

Answer

Study These Flashcards

A

If the two samples have size N and M, then SE (of difference in means) is σ*√1/M+1/N, where σ is a pooled estimate from the SD of each group. Will be between the two SDs.

Question 19

Q

What does paired T test do differently?

Answer

Study These Flashcards

A

Looks at differences between the paired values, and assumes that these differences come from a population with zero mean, rather than the means of each group.

Question 20

Q

Equation for paired t-statistics?

Answer

Study These Flashcards

A

Technically the same as before i.e. m1-m2/SE (although SE calculated differently). In reality, expressed as d/SE where d is the mean of the differences. This is identical to just having the means of the columns as in unpaired, but the SE is calculated differently.

Question 21

Q

SE in paired and unpaired?

Answer

Study These Flashcards

A

Unpaired SE is based on SD which measures the variation between patients; in paired just looking at intrapatient variation which is obviously smaller. So instead = SD (of the differences)/√N. Pairing gives more sensitive experiment but only if reflected in this analysis.

Question 22

Q

Assumptions underlying paired T test?

Answer

Study These Flashcards

A

Only one: that the differences come from a normal distribution.

Standard errors and confidence intervals Flashcards

(22 cards)