HaDPop S4 - Sources of Variation Flashcards

Question 1

Q

Explain the concept of random variation

Answer

A

Coins tend to produce equal numbers of heads and tails, but what we observe may depart from this by random variation

Question 2

Q

Give an example of the difference between tendency and observed values

Answer

A

Tendency: the “true” or “underlying” tendency is for 4 cases per month of meningitis in Leicestershire
Observation: in January, February and March this year, we observed 2, 5 and 4 cases respectively
N.B. Our “observed” value is our best estimate of the “true” or “underlying” tendency

Question 3

Q

Define hypothesis

Answer

A

A hypothesis is a statement that an underlying tendency of scientific interest takes a particular quantitative value e.g. true prevalence of tuberculosis in a given population is 2 per 10,000 people

Question 4

Q

What is formal hypothesis testing?

Answer

A

Calculate the probability of getting an observation as extreme as, or more extreme than, the one observed assuming that the stated hypothesis is true. If the probability is very small, it is reasonable to conclude that the data and the stated hypothesis are incompatible. Therefore with a small probability, EITHER something very unlikely has occurred (and hypothesis true) or the stated hypothesis is wrong

Question 5

Q

What is the p-value?

Answer

A

Calculated probability

Question 6

Q

Why do we use p=0.05?

Answer

A

Purely conventional

Question 7

Q

What is the interpretation of different p-values?

Answer

A

P-value 0.05

Cannot reject
Does NOT mean that the hypothesis has been proven

Question 8

Q

Why isn’t rejecting a hypothesis always useful?

Question 9

Q

What is statistical significance?

Answer

A

Results haven’t occurred by chance alone. More confident that we are observing a “true” result

Question 10

Q

What are almost all observed quantities e.g. rate ratio in medical science subject to?

Answer

A

Variation by chance

Question 11

Q

How can we make rational inferences about the real size of a quantity of importance given the variation?

Answer

A

Use the 95% confidence interval

Question 12

Q

Give an example of a confidence interval

Answer

A

0.59—–0.87—–1.14
Point estimate (“best guess”) = 0.87
95% confidence limits are 0.59 and 1.14
Our best guess is that the “true” rate ratio is 0.87 and we can be 95% certain that the “true” rate ratio lies somewhere between 0.59 and 1.14

Question 13

Q

What is a confidence interval (CI)?

Answer

A

CI expresses the precision of an estimate and is often presented alongside the results of a study. The narrower the interval, the more precise the estimate

Question 14

Q

What is the 95% confidence interval?

Answer

A

The range within which we can be 95% certain that the “true” value of the underlying tendency really lies. The range is centred on the observed value because it is always our best guess at the “true” underlying value. Therefore the “observed” value is always within the 95% CI - it cannot be anywhere else

Question 15

Q

What and who does a prevalence survey measure?

Answer

A

What? Case definition

- Who? Sampling frame, sampling proportion, sampling technique, response

Question 16

Q

What is the relationship between the 95% confidence interval and the p-value?

Answer

Study These Flashcards

A

Values in the 95% CI are consistent with the data
1.00 (the null hypothesis value for a rate ratio) is consistent with the data
If the null hypothesis value is consistent with the observed data, then any observed difference from the null hypothesis may be due to chance
Null hypothesis value inside 95% CI -> p>0.05
Null hypothesis value outside 95% CI -> p

Question 17

Q

How do you calculate the 95% confidence interval?

Answer

Study These Flashcards

A

Calculate observed value (i.e. IRR) of whatever you are interested in
Calculate the error factor
Lower 95% Confidence Limit = observed factor / e.f.
Upper 95% Confidence Limit = observed factor x e.f.
95% CI is the range from Lower 95% CL to Upper 95% CL

Question 18

Q

What happens as we get more data?

Answer

Study These Flashcards

A

We get more and more sure about the “true” underlying value, the e.f. gets smaller and the 95% CI narrower

Question 19

Q

What can CI estimation incorporate?

Answer

Study These Flashcards

A

Hypothesis
Data
Incidence rates
Incidence rate ratio

Question 20

Q

Show how to calculate CI using a rate

Answer

Study These Flashcards

A

Population = d cases in P person-years
Rate = d/P
Error factor = e^{2 x rt(1/d)}
95% CI = rate/e.f. to rate x e.f.

Question 21

Q

Show how to calculate 95% CI from a rate ratio

Answer

Study These Flashcards

A

Population 1 = d1 cases in P1 person-years
Population 2 = d2 cases in P2 person-years
Rate ratio = (d1/P1) / (d2/P2)
Error factor = e^{2 x rt(1/d1 + 1/d2)}
95% CI = RR/e.f. to RR x e.f.

Question 22

Q

Show how to calculate 95% CI using SMR

Answer

Study These Flashcards

A

Observe O deaths
Expect E deaths (based on standard population age-sex specific rates applied to study population age-sex groups)
SMR = O/E x 100
Error factor = e^{2 x rt(1/O)}
95% CI = SMR/e.f. to SMR x e.f.

HaDPop S4 - Sources of Variation Flashcards

(22 cards)