HaDPop S4 - Sources of Variation Flashcards

1
Q

Explain the concept of random variation

A

Coins tend to produce equal numbers of heads and tails, but what we observe may depart from this by random variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give an example of the difference between tendency and observed values

A
  • Tendency: the “true” or “underlying” tendency is for 4 cases per month of meningitis in Leicestershire
  • Observation: in January, February and March this year, we observed 2, 5 and 4 cases respectively
  • N.B. Our “observed” value is our best estimate of the “true” or “underlying” tendency
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define hypothesis

A

A hypothesis is a statement that an underlying tendency of scientific interest takes a particular quantitative value e.g. true prevalence of tuberculosis in a given population is 2 per 10,000 people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is formal hypothesis testing?

A

Calculate the probability of getting an observation as extreme as, or more extreme than, the one observed assuming that the stated hypothesis is true. If the probability is very small, it is reasonable to conclude that the data and the stated hypothesis are incompatible. Therefore with a small probability, EITHER something very unlikely has occurred (and hypothesis true) or the stated hypothesis is wrong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the p-value?

A

Calculated probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why do we use p=0.05?

A

Purely conventional

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the interpretation of different p-values?

A

P-value 0.05

  • Cannot reject
  • Does NOT mean that the hypothesis has been proven
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why isn’t rejecting a hypothesis always useful?

A
  • p
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is statistical significance?

A

Results haven’t occurred by chance alone. More confident that we are observing a “true” result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are almost all observed quantities e.g. rate ratio in medical science subject to?

A

Variation by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can we make rational inferences about the real size of a quantity of importance given the variation?

A

Use the 95% confidence interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Give an example of a confidence interval

A

0.59—–0.87—–1.14
Point estimate (“best guess”) = 0.87
95% confidence limits are 0.59 and 1.14
Our best guess is that the “true” rate ratio is 0.87 and we can be 95% certain that the “true” rate ratio lies somewhere between 0.59 and 1.14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a confidence interval (CI)?

A

CI expresses the precision of an estimate and is often presented alongside the results of a study. The narrower the interval, the more precise the estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the 95% confidence interval?

A

The range within which we can be 95% certain that the “true” value of the underlying tendency really lies. The range is centred on the observed value because it is always our best guess at the “true” underlying value. Therefore the “observed” value is always within the 95% CI - it cannot be anywhere else

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What and who does a prevalence survey measure?

A
  • What? Case definition

- Who? Sampling frame, sampling proportion, sampling technique, response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the relationship between the 95% confidence interval and the p-value?

A
  • Values in the 95% CI are consistent with the data
  • 1.00 (the null hypothesis value for a rate ratio) is consistent with the data
  • If the null hypothesis value is consistent with the observed data, then any observed difference from the null hypothesis may be due to chance
  • Null hypothesis value inside 95% CI -> p>0.05
  • Null hypothesis value outside 95% CI -> p
17
Q

How do you calculate the 95% confidence interval?

A
  • Calculate observed value (i.e. IRR) of whatever you are interested in
  • Calculate the error factor
  • Lower 95% Confidence Limit = observed factor / e.f.
  • Upper 95% Confidence Limit = observed factor x e.f.
  • 95% CI is the range from Lower 95% CL to Upper 95% CL
18
Q

What happens as we get more data?

A

We get more and more sure about the “true” underlying value, the e.f. gets smaller and the 95% CI narrower

19
Q

What can CI estimation incorporate?

A
  • Hypothesis
  • Data
  • Incidence rates
  • Incidence rate ratio
20
Q

Show how to calculate CI using a rate

A
  • Population = d cases in P person-years
  • Rate = d/P
  • Error factor = e^{2 x rt(1/d)}
  • 95% CI = rate/e.f. to rate x e.f.
21
Q

Show how to calculate 95% CI from a rate ratio

A
  • Population 1 = d1 cases in P1 person-years
  • Population 2 = d2 cases in P2 person-years
  • Rate ratio = (d1/P1) / (d2/P2)
  • Error factor = e^{2 x rt(1/d1 + 1/d2)}
  • 95% CI = RR/e.f. to RR x e.f.
22
Q

Show how to calculate 95% CI using SMR

A
  • Observe O deaths
  • Expect E deaths (based on standard population age-sex specific rates applied to study population age-sex groups)
  • SMR = O/E x 100
  • Error factor = e^{2 x rt(1/O)}
  • 95% CI = SMR/e.f. to SMR x e.f.