statistics Flashcards

1
Q

How do you calculate incidence percentage? How do you calculate incidence per N

A

Usually reported over a time period
– E.g. 1% per year or 1 in 100 per year
– 2 per 100,000 per year = 0.002% per year
* (new cases / population)*100 = %age
* (new cases / population) *n = rate per n
– E.g. (new cases / population) *100,000 = rate
per 100,000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Prevalence
-What is this
-How do you calculate prevalence at one point in time
-How do you calculate prevalence over an interval of time

A

Proportion of population with a disease:
– at a point in time (point prevalence)
– over an interval of time (period prevalence)
* Calculation of point prevalence
– (all cases/population)100 = %age
* Calculation of period prevalence
– (all cases during a give time / population over
that period)
100 = %age

Relationship

prevalence = incidence * duration of condition
in chronic diseases the prevalence is much greater than the incidence
in acute diseases the prevalence and incidence are similar. For conditions such as the common cold the incidence may be greater than the prevalence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

P values
-What does a P value of 0.1 mean?
-What does a p value of 0.01 mean?

A
  • P = 0.1 = 1 in 10 probability due to chance
  • P = 0.01 = 1 in 100 due to chance
    – “highly statistically significant”
  • P = 0.05 = 1 in 20 due to chance
    – often arbitrarily set as statistically significant
  • P = 0.001 = 1 in 1000 due to chance
    – “very highly statistically significant”
  • P value does not tell us ANYTHING about
    CLINICAL significance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is null hypothesis? when is this rejected? when is this accepted?

A

Null Hypothesis – H0
* Null hypothesis – that there is no
difference between groups
* If p value statistically significant, we
REJECT the null hypothesis and accept
the finding as genuine
* If p value is NOT statistically significant,
we ACCEPT the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is type 1 error vs type 2 error

A

Type 1 error is a false positive
– Positive result actually due to chance
– False rejection of null hypothesis
* Type 2 error is a false negative
– Negative result actually due to error
– False acceptance of null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a confidence interval?

A

Give us an idea where the TRUE
population value lies rather than the
observed value
* Expressed as a range
* Usually interested in 95% CI
* This means that there is a 95% chance the
TRUE value falls within this range
* As sample size increases, 95% CI range
decreases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are properties of normal distribution?

A

Properties of the Normal distribution

symmetrical i.e. Mean = mode = median
68.3% of values lie within 1 SD of the mean
95.4% of values lie within 2 SD of the mean
99.7% of values lie within 3 SD of the mean
this is often reversed, so that within 1.96 SD of the mean lie 95% of the sample values
the range of the mean - (1.96 *SD) to the mean + (1.96 * SD) is called the 95% confidence interval, i.e. If a repeat sample of 100 observations are taken from the same group 95 of them would be expected to lie in that range

Standard deviation

the standard deviation (SD) is a measure of how much dispersion exists from the mean
SD = square root (variance)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you calculate sensitivity?

A

Sensitivity
TP / (TP + FN ) Proportion of patients with the condition who have a positive test result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you calculate specificity?

A

Specificity
TN / (TN + FP) Proportion of patients without the condition who have a negative test result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a positive predictive value?

A

Positive predictive value
TP / (TP + FP)
The chance that the patient has the condition if the diagnostic test is positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a negative predictive value?

A

Negative predictive value
TN / (TN + FN)
The chance that the patient does not have the condition if the diagnostic test is negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you calculate likelihood ratio for positive test result?

A

Likelihood ratio for a positive test result sensitivity / (1 - specificity)
How much the odds of the disease increase when a test is positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you calculate Likelihood ratio for a negative test result?

A

Likelihood ratio for a negative test result (1 - sensitivity) / specificity How much the odds of the disease decrease when a test is negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe the 4 phases of a study

A

Phase 1

small studies (e.g. 100) on healthy volunteers
used to assess pharmacodynamics and pharmacokinetics

Phase 2

small studies (e.g. 100-300) on actual patients
examines efficacy, adverse effects

Phase 3

larger studies (e.g. 500-5,000 patients)
examines efficacy, adverse effects
may compare drug with existing treatments
studies of special groups e.g. renal, elderly

If drug shown to be safe and effective then drugs may be approved for marketing

Phase 4

post-marketing surveillance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the hawthorne effect?

A

Hawthorne effect - describes a group changing it’s behaviour due to the knowledge that it is being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is selection bias?

A

Error in assigning individuals to groups leading to differences which may influence the outcome. Subtypes include sampling bias where the subjects are not representative of the population. This may be due to volunteer bias. An example of volunteer bias would be a study looking at the prevalence of Chlamydia in the student population. Students who are at risk of Chlamydia may be more, or less, likely to participate in the study. A similar concept is non-responder bias. If a survey on dietary habits was sent out in the post to random households it is likely that the people who didn’t respond would have poorer diets than those who did.

Other examples include

loss to follow up bias
prevalence/incidence bias (Neyman bias): when a study is investigating a condition that is characterised by early fatalities or silent cases. It results from missed cases being omitted from calculations
admission bias (Berkson's bias): cases and controls in a hospital case control study are systematically different from one another because the combination of exposure to risk and occurrence of disease increases the likelihood of being admitted to the hospital
healthy worker effect
17
Q

What is recall bias?

A

Difference in the accuracy of the recollections retrieved by study participants, possibly due to whether they have disorder or not. E.g. a patient with lung cancer may search their memories more thoroughly for a history of asbestos exposure than someone in the control group. A particular problem in case-control studies.

18
Q

What is publication bias?

A

Failure to publish results from valid studies, often as they showed a negative or uninteresting result. Important in meta-analyses where studies showing negative results may be excluded.

19
Q

What is work-up / verification bias?

A

In studies which compare new diagnostic tests with gold standard tests, work-up bias can be an issue. Sometimes clinicians may be reluctant to order the gold standard test unless the new test is positive, as the gold standard test may be invasive (e.g. tissue biopsy). This approach can seriously distort the results of a study, and alter values such as specificity and sensitivity. Sometimes work-up bias cannot be avoided, in these cases it must be adjusted for by the researchers.

20
Q

What is expectation bias? pygmalion bias?

A

Only a problem in non-blinded trials. Observers may subconsciously measure or report data in a way that favours the expected study outcome.

21
Q

What is late look bias?

A

Gathering information at an inappropriate time e.g. studying a fatal disease many years later when some of the patients may have died already

22
Q

What is procedure bias?

A

Occurs when subjects in different groups receive different treatment

23
Q

Lead-time bias

A

Occurs when two tests for a disease are compared, the new test diagnoses the disease earlier, but there is no effect on the outcome of the disease

24
Q

How do you calculate standard error of the mean? how do you calculate upper limit and lower limit of a 95% confidence interval?

A

The standard error of the mean (SEM) is a measure of the spread expected for the mean of the observations - i.e. how ‘accurate’ the calculated sample mean is from the true population mean

Key point

SEM = SD / square root (n)
where SD = standard deviation and n = sample size
therefore the SEM gets smaller as the sample size (n) increases

A 95% confidence interval:

lower limit = mean - (1.96 * SEM)
upper limit = mean + (1.96 * SEM)
25
Q

What is the power of a study?

A

The power of a study is the probability of (correctly) rejecting the null hypothesis when it is false, i.e. the probability of detecting a statistically significant difference

power = 1 - the probability of a type II error
power can be increased by increasing the sample size
26
Q

what may limit use of a randomised control trial?

A

practical or ethical problems

27
Q

what is the usual outcome of a cohort study?

A

relative risk

28
Q

what is the usual outcome of a case-control study?

A

odds ratio

29
Q

Cross-sectional surveys - AKA, what does this provide?

A

Provide a ‘snapshot’, sometimes called prevalence studies

Provide weak evidence of cause and effect