General Flashcards

1
Q

examples of interventional study designs

A

N-of-1 randomised trials
systematic reviews
randomised controlled trials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

examples of observational designs

A

cohort studies
case-control studies
cross-sectional studies
ecological studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

examples of descriptive designs

A

case series

case reports

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

functions of good research design

A
  • enable a comparison
  • allow the comparison to be quantified
  • determine the temporal sequence of the risk
  • identify the risk factor for the disease
  • minimise third variable effects: bias, confounding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

features of cohort studies

A
  • exposure measurement occurs before outcome ascertainment; subjects are recruited based on their exposure
  • controls do not have the exposure
  • exposure is measured at or near the beginning of the study
  • all groups are followed through time
  • the outcome is measured when it occurs; cases arise during the study
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

in a cohort study, how might exposure be measured?

A

questionnaire, blood and tissue samples, pre-existing records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

in a cohort study, how might outcomes be ascertained?

A
  • follow-up: is time consuming and expensive, can lead to false diagnoses
  • pre-existing records: e.g. registries, medical records
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

can risk be directly calculated in cohort studies?

A

Yes: absolute risk, attributable risk (risk difference), relative risk (risk ratio), survival curves

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

advantages of cohort study designs?

A
  • for rare exposures and common outcomes
  • rigorous epidemiological design (able to directly measure incidence)
  • provide temporal sequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

disadvantages of cohort study design?

A
  • expensive
  • time consuming
  • case and controls may differ on important outcome predictors “need to have good case and control selections so that confounding isn’t coming into the sample selection: once you select your sample, you can’t go back and repeat because the timeframes are long”
  • can be susceptible to bias and confounding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the types of bias present in cohort studies?

A
  • confounding
  • sampling bias (selection bias: a sample is collected in such a way that some members of the population are less likely to be included than others)
  • migration bias (type of sampling, and selection bias: excluding subjects who have recently moved into or out of a study area)
  • measurement bias (measurement of exposure or outcome is not similar between groups of patients studied)
  • misclassification bias (measurement error)
    (effect modification)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do prospective and retrospective cohort study designs differ?

A

Re when the cohort is assembled, in the past for retro, present for prospective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Advantages of prospective cohort studies

A
  • able to collect life and demographic data not available on medical records
  • able to set up a standardised way of measuring exposure and degree of exposure to risk factors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

disadvantages of prospective cohort studies

A
  • long duration
  • expensive
  • loss to follow up
  • cannot be used for rare diseases
  • inefficient because many more subjects need to be enrolled
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

advantages of retrospective cohort studies

A
  • more efficient than prospective cohort study because data is already collected
  • cheaper than prospective cohort study because there is no need for long follow up
  • faster because patient outcomes have already been collected
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

disadvantages of retrospective cohort studies

A
  • long duration
  • expensive (less expensive than prospective)
  • loss to follow up
  • reliance on records; cannot examine a patient characteristic not already recorded
  • measurement of exposure and degree of exposure may not be standardised; may not get a standard exposure measurement across different sites, or from patient to patient
  • problem with time-dependent exposures; difficult to find temporal sequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define relative risk/risk ratio

A
  • ratio of risk in exposed to risk in unexposed persons
  • “how many times more likely are exposed persons to get the disease relative to non-exposed persons?”
  • relative risk = Risk in exposed / risk in unexposed where a risk ratio of >1 means the exposure increases the risk of disease while, =1 doesn’t change the risk of disease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are attributable and relative risk calculated from?

A

the incidence/absolute risk of an outcome in an exposed and unexposed group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

4 components of reporting risk

A

exposed group, unexposed group, the relative risk and the outcome e.g. “Oestrogen users have 1.27 times the risk of developing breast cancer compared to oestrogen non-users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Calculate Population-attributable risk (PAR)

A

PAR = attributable risk x prevalence of the exposure in the population
or PAR = incidence rate in population - incidence rate in unexposed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Define Population-Attributable fraction (PAF)

A
PAF = (population attributable risk / incidence in total population) x 100 or what fraction of disease in a population is attributable to exposure to a risk factor
PAF = (incidence in total population - incidence in unexposed)/ incidence in total population x 100
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Prognosis

A

prediction of the course of disease following its onset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Define prognostic factors

A

patient characteristics that are associated with the outcome of the disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Differences between risk and prognosis

A
  • risk factor relates risk factor to disease, prognostic factor relates disease to outcome
  • risk factors deal with healthy people, prognosis deals with sick people
  • risk factors the outcome is usually disease onset, prognosis the outcome is the consequence of disease e.g. suffering, death, disability, complications
  • risk factors usually for low probability events, prognosis factors are for relatively frequent events
  • factors may be different; variable associated with an increased risk are not necessarily the same as those marking the worse prognosis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the difference between clinical course and natural history?

A

clinical course = evolution or prognosis of a disease that has come under medical care and has been treated in a variety of ways that affect subsequent course of events
natural history = prognosis of a disease without medical intervention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

important features of prognosis studies

A

patient selection, time of entry into the study, follow-up, outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What are the ways prognosis can be reported?

A

absolute risk, relative risk, odds ratio, 5-year survival, case-fatality, disease-specific mortality, response, remission, reoccurrence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

How can you be sure a finding isn’t due to chance / or is causal?

A

When confounding variables are absent, where there are no selection and measurement biases, with p values and hypothesis testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How do you tell if a finding isn’t due to chance/ or is causal?

A

hypothesis testing and p values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is random error?

A

poor precision by chance alone

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is systematic error?

A

measurements differ from the truth in a systematic way - a degree of error is always inevitable e.g. blood pressure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

When does selection bias occur? What does it result in?

A

occurs when comparisons are made between two groups that are different in ways other than the main factor under study
the difference between the two groups affects the outcomes of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is the problem with using volunteers in studies?

A

Volunteers are generally more health conscious compared to the rest of the population (selection bias)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Types of selection biases?

A
  • volunteers
  • low response rates - control group, might not be as interested
  • ascertainment or detection bias - an individual’s chance of being diagnosed with a particular disease is relating to whether they’ve been exposed (not detecting the exposure correctly)
  • healthy worker effect; those able to work are healthier than the broader population (excluding disability, elderly, severely ill)
  • loss to follow up
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

How to control selection bias in case-control studies?

A

strong case definition, objective ways to assess this
appropriate control selection
high participation rate in both groups
clearly defined inclusion and exclusion criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

How to control for selection bias in cohort studies?

A

maximise retention of the cohort (exposed and non-exposed)
where/if possible follow up those who dropped out
clearly defined inclusion and exclusion criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

When does measurement bias occur?

A

if the methods of measurement are not similar among the groups of patients; either measurement of the exposure or outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What are types of measurement bias?

A
  • recall bias - differences in accuracy of recall of memory e.g. someone inaccurately memory, mothers of children with leukaemia (was mum exposed to X-ray when pregnant mother might be more likely to recall that event than a mother with children without leukaemia)
  • reporting bias - incomplete medical notes, some information considered unimportant
  • interviewer or observer bias; errors due to individuals taking the measurements
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Ways to control measurement bias?

A
  • clear, precise and objective definition
  • appropriate choice of measurement definition
  • quality control
  • is the measurement accurate and precise
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What are special biases associated with screening tests?

A
  • Lead-time bias: (Huntington’s disease) overestimation of the survival time due to the backward shift in the starting point for measuring survival that arises from early detection procedures
  • length-time bias: (slow vs fast growing cancer) a systematic error due to the inappropriate selection of long-duration cases
  • compliance bias: because screening requires a patient to present and be regularly screening the type of patient presenting for screening is generally more compliant than those who are not presenting at screening
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is channelling bias?

A

type of selection bias. Drugs with similar therapeutic benefits are prescribed to groups with different prognoses: e.g. a new drug may be prescribed to a group with pre-existing morbidity, so a disease state may be incorrectly attributed to use of the drug.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is data completeness bias?

A

type of measurement/information bias. Missing data can lead to different results recorded for the same finding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is surveillance bias?

A

type of measurement bias. “the more you look, the more you find” some patients are followed up more or have more diagnostic tests performed on them than others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is partial verification bias?

A

measurement bias. In the 1980’s CT scans were obtained when there was a headache present because clinicians assumed that headache increased the likelihood of haemorrhage as a cause of the neurologic deficit. If CTs were more likely to be obtained in the evaluation of acute neurologic deficits when headache was present, then a study of headache as a predictor of haemorrhage would result in higher sensitivity and lower specificity than if headache played no role in obtaining the gold standard CT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is publication bias?

A

(Selection bias). When not only the quality, but the results and hypotheses tested influence whether or not a paper is published. Can bias systematic reviews. e.g. research with significant results are more likely to be published.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is referral or verification bias?

A

Measurement bias. The results of a diagnostic test affect whether the gold standard procedure is used to verify the test result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is reporting bias?

A

Selection bias. Selective revealing or suppression of information by subjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is social desirability bias?

A

Measurement bias. The tendency of survey respondents to answer questions in a manner that will be viewed favourably by others.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is spectrum bias?

A

measurement bias. The performance of a diagnostic test may differ between groups due to the different mix of people in each group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Cumulative incidence equation?

A

number of people who develop disease in a specified period/ number of people at risk of getting the disease (at the start of the period)
- measures the proportion of (a group of) people who develop disease during a specified time period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Incidence rate / incidence density equation?

A

number of people who develop disease/ number of person-years when people are at risk of getting the disease
- measured how quickly people are developing a disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

equation for attributable risk?

A

incidence in exposed - incidence in unexposed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

equation for risk ratio?

A

cumulative incidence in exposed / cumulative incidence in unexposed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

sampling

A

process of selecting a representative part of the whole population for the purposes of determining parameters or characteristics of the whole population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

3 characteristics to think about when determining a population

A
  1. members of the population should be susceptible to the outcome
  2. population relevant to questions begin asked
  3. sufficiently described so that you can decide to which people the results of the study applies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

purpose of statistics

A

draw inferences from samples about the underlying population or show how representative a sample is of a population - sometimes the degree to which the sample is representative of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Sampling error

A

differences between the sample results and the population characteristic being measured, needs to be kept small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

factors contributing to sample error

A

bias and random sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

what is random sampling error

A

variations within the sample due to the natural variations seen between individuals - occur purely by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

ways to handle random sampling error

A
  • appropriate sampling techniques (e.g. random selection)
  • appropriate sample size (power = statistical terminology): the larger the sample size, the greater variation is measured, the closer the sample value is to the true population variable (the more precise the measurement)
  • appropriate statistical calculations measuring error occurring by chance (confidence intervals and p-values) - how much the association is there by chance
  • can’t just say the sample isn’t big enough - re study critique - didn’t do a para calculation, what about the techniques to select the sample? ways of measuring cause and effects?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What is a target population?

A

the collection of individuals about which inferences are desired

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What is the study population?

A

the collection of individuals from which the sample is drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What is the sample?

A

the collection of individuals on which the investigation is conducted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

What are the two main sampling methods

A

non-probability methods and probability methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What are non-probability methods

A
  • where the probability of choosing someone is unknown
  • cheaper, but unable to generalise, potential for biases
    e. g.
  • availability sampling (people are easy to find),
  • quota sampling
  • convenience sampling: made up of people who are easy to reach e.g. pollster interviews shoppers at a local mall
  • deviant case sampling (those who are extremes of that population are chosen)
  • typical case sampling (averages are selected)
  • snowball sampling (one person tells other people, other people tell other people - e.g. recruiting through Facebook, particularly good in communities hard to break into like drug addicts)
  • expert sampling (people who know the most about it are selected - or point to various populations)
  • critical case sampling (e.g. using small island nations to analyse the effects of climate change and apply it to larger landmasses)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What are probability methods

A

the probability of being sampled is known e.g. random sampling methods in RCTs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

Importantly, when can statistical inferences be made?

A

Only when random samples are used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

What is simple random sampling (SRS)

A

akin to pulling names out of a hat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

What is systematic sampling

A

taking the ith number of a list - less efficient than SRS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

What is stratified sampling

A

taking SRS from mutually exclusive and exhaustive subdivisions of the population: more efficient than SRS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

What is cluster sampling

A

taking SRS from defined clusters; less efficient than SRS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

What is a type I (alpha) error

A

is the probability of finding a difference with the sample compared to the population, when there isn’t really one.
- usually set at 0.05 or 5%
- OR the probability of rejecting the Ho when it is true
=false-positive conclusion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

What is a type II (beta) error

A

the probability of not finding a difference that actually exists between our sample compared to the population, when there really is one. usually set at 0.2 or 20% depending on the study design, again an arbitrary allocation; or 20% chance of missing a true difference is considered reasonably acceptable
- OR probability of accepting Ho when it is false (power; power = 1 - type II error (beta)
=false-negative conclusion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

what are the four factors sampling size is dependent on

A
  1. level of significance required (p values, CI intervals)
  2. difference between groups you wish to detect (large or small difference)
  3. the variability of the estimate
  4. cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

as the sample size increases… (sample size is dependent on four factors: significance, difference, variability, cost)

A
  • the level of significance becomes smaller
  • the difference becomes smaller
  • the variability increases
  • the cost per unit is reduced
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

related information generated and sample size

A

in general, the information generated is a squared function of sample size:

  • to get twice the information, you need four times the sample
  • to get three times the information, you need nine times the sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

clinical considerations about sampling and samples

A
  • numbers are subordinate to method: a study with rigorous design and a small sample size is superior to a study with poor study design and a larger sample size
  • methods and numbers are subordinate to cost: cost is always a factor
  • all issues are subordinate to ethical considerations: need to make sure the sample is big enough that an effect can be detected if it is there or prove that it is not there + small enough that we are not asking more participants than is necessary to undergo the inconvenience of being a part of the study
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

What are the two approaches to chance?

A
  • hypothesis testing (null, alternate hypothesis) - dichotomous outcome, either statistically significant or not e.g. uses the p value
  • estimation: estimate a range of values in which the true value is likely to occur e.g. uses confidence intervals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

If an RCT is testing whether Drug A is better, what are the four possible conclusions?

A
  • Drug A is better than usual care and this is the conclusion of the study
  • *Drug A is the same and has similar effects to usual care and the study concludes that a difference is unlikely
  • *Drug A is the same and has similar effects to usual care, but it is concluded that drug A is more effective
  • Drug A is more effective than usual care, but the study concludes that there is no difference between drug A and usual care
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

What is used to estimate the effects of random variation?

A

statistical testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

When bias is absent, what is responsible for statistical uncertainty?

A

random error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

steps in how to approach a clinical question?

A
  • construct an answerable question
    e. g. ‘what is the relationship between coeliac disease and the development of neuropathy’
  • set decision threshold (comes from clinical experience)
    e. g. a 20-30% increased risk of neuropathy is clinically important (5% level of significance)
  • gather information
    e. g. data collected on small intestine biopsies and matched to neuropathies between 1969-2008
  • process information (mechanical act of dealing with large amounts of information - select the right graphical representation of the data, correct selection of statistical tests, cleaned the data)
    e. g. risk of neuropathy HR = 2.5 with 95% CI (2.1-3.0), so the results are statistically significant because a HR of 1 = no effect and the CI doesn’t include 1: p value of
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

What is inference?

A

drawing conclusions from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

What is statistical inference?

A

drawing conclusions from quantitative or qualitative information using statistical processes to describe and arrange the data and, in some cases, to test the suitable hypotheses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

What are the two major forms of inferences?

A

estimates (point and interval - ways to describe the data) and hypotheses (use statistics to test the veracity of a hypothesis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

what is a point estimate?

A

use of sample data to estimate population parameters
- or the best estimate about the population from the data arising within the sample e.g. mean, SD, proportion
= 1 representative figure *the true effect size is unlikely to be exactly that observed in the study (random variation) so for any point estimate, need to have a summary measure to give the statistical precision of that point estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

What is interval estimate?

A
  • range of reasonable values containing the parameters with a certain degree of confidence
    = estimate + and - [confidence x variability]
    use of sample data to provide a range of reasonable values intended to contain population parameters with certain degrees of confidence
  • the more narrow a CI, the more confident one can be about the true size of the effect
    = the true value is more likely to be closer to the point estimate than the outer limits of the interval estimate
  • 95% CI, 5 times out of 100, the true effect size could fall outside the limits or CI; gives statistical precision of the estimate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

For data that is individual observations, what measure of spread is used?

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

If measurements have been taken from different sample populations, and you want to graph the distribution of the grouped means, what measure of spreads are appropriate and inappropriate?

A

appropriate: standard error of the mean
inappropriate: SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

When is SD used?

A

to describe the spread in the frequency distribution
- spread of the individual data; but doesn’t tell what is happening in the population, doesn’t extrapolate data from a sample to a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
91
Q

What is SEM?

A

standard error of the mean = standard deviation for the means

  • helps to estimate the probable error of the sample mean’s estimate of the true population mean
  • standard error = SD / [(square root of N) - 1] where N is sample size; the larger the sample size, the smaller the standard error
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
92
Q

Normal distribution values?

A

1 SD = 68%
1.96 SD = 95%
2 SD = 95.4%
3 SD = 99.7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
93
Q

Equation for confidence intervals?

A

CI = mean (-1.96 x SEM), (+1.96 x SEM) = 95% CI, 95% sure the interval will contain the true population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
94
Q

When might confidence intervals be used?

A

to characterise the statistical precision of:
- any rate: incidence or prevalence
- diagnostic test performance
- comparisons of rates: RR, OR, HR
- other summary statistics
to get statistical significance at 0.05: if the point corresponding to no effect (e.g. RR =1 or treatment effect = 0) falls outside the 95% CI for the observed effect, the results are statistically significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
95
Q

How should a confidence interval be interpreted?

A
  • there is a 95% chance that the true population effect lies within the stated confidence limits
  • if the study was repeated 100 times, 95 times out of the 100 the true effect would lie within the expressed confidence limits
  • we are 95% sure that what we have measured in the sample is true for the population
  • not there is a 95% probability that the parameter lies within the CI - obvious, we are 100% the sample lies within the 95% - not about accuracy of data, but that the data in the sample will be representative of the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
96
Q

benefits of interval estimates?

A

more useful in clinical studies than hypothesis testing

  • gives how much uncertainty is inherent in the sample statistic because it gives the interval on either side
  • formula captures the uncertainty in a sample statistic by defining an interval likely to capture the population parameter with a specific degree of confidence
  • the width of the interval estimate depends on:
    • level of confidence (higher level, wider interval) e.g. 99% CI?
    • variability (sample size: smaller sample, wider interval)
    • standard error (larger standard error, wider interval)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
97
Q

Does statistical significance also mean clinical significance?

A

Statistical significance isn’t always clinically significant; clinically, the effect might be really small, the range might be too wide to have much meaning; depends on the question asked, the patient themselves, clinical experience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
98
Q

What is hypothesis testing?

A
  • drawing of inferences between competing hypotheses as a prediction of the examination of the data is going to show
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
99
Q

What are the two types of hypotheses?

A
  • null hypothesis (Ho): the hypothesis to be tested
  • alternative hypothesis (H1): the hypothesis contradicting Ho - usually stated in the hypothesis
  • all testing is done against the null hypothesis: we reject or fail to reject Ho
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
100
Q

what are the steps in hypothesis testing?

A
  • Construct Ho
  • construct H1
  • determine level of significance
  • collect data
  • calculate test statistic
  • calculate p-value
  • compare p-value with level of significance
  • reach decision
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
101
Q

What should be considered in choosing a test statistic (test statistic is step 5 of hypothesis testing)?

A

use the table in lecture 21. Predictor = exposure (was the data binary, categorical, ordinal or continuous)
+ outcome (binary, categorical, ordinal or continuous)
- what type of data is it, and is the data predictor or outcome data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
102
Q

What is the p-value? (step 6 of hypothesis testing)

A
  • the p value is a quantitative estimate of the probability that a difference in the treatment effect in a particular study could have happened by random error assuming that there is no difference between the two groups
  • the calculated level of significance at which we would be indifferent between accepting and rejecting the null hypothesis given the sample at hand: the smaller the p value, the less likely there is no difference between the two groups
  • the probability that the test statistic or relationship would be as extreme or more extreme as the observed value if the null hypothesis were true
  • “if there was no difference between treatment groups upon repeated studies, what proportion of the trials would conclude that the difference between the two treatments was at least as large as that found in this study” a large p value supports the null hypothesis that there is no difference between treatment groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
103
Q

What is statistical significance in terms of p value?

A
  • p value of
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
104
Q

Comparing CI and p values, what are their advantages?

A

CI: emphasise size of the effect, allows the reader to see the range of plausible values and determine the clinically meaningful results, provide information about statistical power
p values: tradition, convenience, sometimes CI aren’t feasible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
105
Q

what is statistical power?

A

the probability of finding a difference when a difference really exists
- or the probability that a study will find a statistically significant difference when a difference really exists
- where power = 1 - beta
= a determinant of sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
106
Q

What does a t-test do?

A

compares means of a continuous variable in two research samples e.g. treatment and control groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
107
Q

What are the two types of t-tests?

A
  • A student t-test: used if the samples come from two different groups
  • a paired t-test: used if the samples come from the same group (e.g. systolic BP measured in a group of women before and after exercise)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
108
Q

what is a z-test?

A

works like a t-test but compares the proportion of two groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
109
Q

When will an ANOVA test be used (analysis of variance)?

A

dependent variable is continuous, independent variable is categorical (nominal, dichotomous, or ordinal)

110
Q

What is an N-way ANOVA test?

A

if more than 1 independent variable is used, N is the number of independent variables

111
Q

What test for continuous predictor (age) and a binary outcome (cancer)?

A

logistic regression

112
Q

What test for continuous predictor (birth weight) and continuous outcome (systolic blood pressure)?

A

Pearson correlation; linear regression

113
Q

What is correlation?

A

a measurement of the strength of the association between two study variables

  • strength of a linear relationship
  • often referring to the Pearson’s product moment correlation coefficient, denoted by ‘r’
114
Q

What is regression?

A
  • ability to get line of best fit: a predictive model
    derives a prediction equation for estimating the value of one variable given the value of the second
  • determines line of best fit; slope and intercept
  • data occurs in pairs of measurements (X, Y)
115
Q

What does the correlation coefficient ‘r’ do?

A

defines both the strength and direction of the linear relationship between two variables

  • correlation coefficient is an index between -1 and +1
  • when r = -1, there is a perfect negative linear association
  • when r = +1, there if a perfect positive linear association
  • when r = 0 there is no linear relationship
  • r doesn’t always indicate causality or clinical significance
  • r is affected by extreme values - normal curve is useful
116
Q

What is the general equation for the sample and population regression line:

A

Y with a hat = b0 + b1X where X is the predictor, b1 is the slope and b0 is the y-intercept
Y = Beta0 + betaiX

117
Q

What is the method of least squares?

A

the statistical technique for finding the regression line

118
Q

How does a regression line test statistical significance?

A

does the slope beta 1 differ from zero?

“does the slope of the population regression line beta 1 differ from zero”

119
Q

What is the coefficient of determination (r^2)?

A

defines the proportion of the total variation in values of the dependent variable (Y) that can be attributed to its linear relationship with the independent variable (X)?

120
Q

Equation for r^2 or the coefficient of determination?

A

r^2 = sum of squares due to regression / total variation in Y

121
Q

Compare the number of variables dealt with in correlation vs regression

A

Correlation deals with 2 variables, regression is limitless

122
Q

Other types of regression not covered in Year A but in journal articles…

A
y = binary: logistic regression 
y = count: link function called Poisson Regression 
y = time to event: Proportional Hazards Regression
123
Q

What is a common statistical test used in clinical research when hazards ratios are used?

A

Cox proportional hazards: time-to-event outcome

124
Q

What is survival analysis?

A

statistical picture of the survival of a group of patients in the form of a graph showing the percentage survival vs time

125
Q

Purpose of survival analysis?

A

report prognosis in such a way that information can be inferred about the likelihood that patients with a certain prognostic factor will experience an outcome at any point in time

126
Q

When is a Kaplan-Meier analysis used?

A

with any dichotomous outcome that occurs only once during follow-up

127
Q

What are the best estimates, for a given set of data, for the probability of survival for members of the cohort?

A

points of the survival curve

128
Q

In a survival curve, the rate of outcomes (decreases/increases/remains unchanged)

A

Can be any: can remain unchanged because the rates of survival are being applied to a diminishing group of people

129
Q

When might a hazard ratio be used?

A

Hazard ratio essentially analogous to relative risk with a time component - the cumulative relative risk over a time period across all groups. E.g. might compare different prognostic factors and the chances of survival for each (multiple survival curves graphed to compare prognostic factors)
- the chance of an event occurring in the treatment arm/ the chance of an event occurring in the control arm

130
Q

What are the features of a case-control design?

A
  • subjects are recruited based on disease: outcome ascertainment occurs before exposure measurement
  • exposure for both cases and controls has occurred at the time of recruitment.
  • controls = similar individuals without the disease
  • the frequency of past exposure is compared between cases and controls
  • relative risk is estimated.
131
Q

What are some important design elements of a case-control study?

A
  • Case selection 1. ideally, cases (those with the disease) are going to be incident cases. Presence and duration of the disease. This is because the risk factor for a disease is whether or not you have it and duration of disease (how long you have the disease before you recover/die). It’s difficult to separate what the effect of the duration of the disease is. 2. Can enrol prevalent cases: used for rare diseases, looks like a cross-sectional study. Problem of determining temporal sequence for exposure-disease relationship. Can also get survival bias (missed severe cases who died before recruitment). 3. Case definition: crucial to case selection. Must only select people who are at risk of exposure (e.g. only war veterans in studying depression on war veterans).
  • control selection. 1. Must be as similar as possible to cases except for absence of disease.
132
Q

Equation for prevalence when prevalence of the disease is low (

A

= incidence x duration

  • or the number of people with disease at a given point in time / total number of people in the population
  • assumes a stationary population
133
Q

What are the approaches to control selection?

A
  • population approach: randomly drawn from population
  • cohort approach: nested case-control
  • hospital and community controls: hospital controls taken from the same hospital but with a different outcome. Community controls are selected from the community that surrounds the hospital. Weakness of community controls; anyone could go to a hospital.
  • multiple controls per case. Improves the ability to determine a difference in risk profile. Get an improvement for every additional control per case up to 3 or 4, then there’s no improvement.
  • matching: cases and controls are matches according to a particular characteristic that may be (strongly) related to the outcome. To reduce variation between groups except for risk factor.
  • negative controls: no-comorbidities.
134
Q

How might exposure be measured in case-controls?

A
  • thoroughly completed medical records or stored samples from lab analysis. Questioning of cases and controls or appropriate proxies (good interviewer and questions; recall and record bias).
135
Q

Advantages of case-controls? (similar to retrospective cohort study)

A
  • can look at multiple exposures. e.g. people with lung cancer. Can look at exposure to smoking, bacteria, air pollution, work exposure.
  • rare outcomes, common exposures.
  • relatively fast
  • relatively inexpensive
136
Q

Disadvantages of case-controls? (similar to retrospective cohort study)

A
  • dependence on appropriate control selection.
  • prone to bias.
  • not good for measurement of rare exposures.
137
Q

What is a nested case-control study?

A

A case-control study/ a group of cases and controls that have been drawn from a cohort population that has been followed for a period of time.
- Might look back at past medical records for additional information about exposure (on top of the findings of the cohort study).

138
Q

How might risk be estimated in case-control?

A

Can’t get a risk as subjects are recruited based on outcome.
Can compare relative exposure frequency in cases and controls.
Can use odds ratio

139
Q

Define odds ratio?

A

The odds that a case is exposed/ the odds that a control is exposed
or the proportion that the event does happen/ the proportion that the event does not happen

140
Q

What does an OR > 1 mean?

A

the frequency of exposure is higher among cases than controls: the greater the odds ratio, the stronger the association

141
Q

What does an OR

A

the frequency of exposure is lower among cases than controls: indicates protection

142
Q

The is OR approximately equal to RR?

A

when the incidence of disease is low (typically less than 10%). It depends on the size of the RR to actually determine how low the incidence must be before the OR is approximately equal to the RR.

143
Q

Which studies rely particularly on accurate diagnosis of disease?

A

Both cohort and case-control studies rely on accurate diagnosis of disease.

  • Assess diagnostic test.
  • Ability to correctly interpret a diagnostic test.
  • Ability to conduct a sequence of tests in order to rule in and rule out possible diagnoses until the correct diagnosis is reached.
144
Q

What are the two ways the effect of an intervention can be measured?

A
  • Observational: in a cohort study, intervention = exposure

- Experimental/Interventional: clinical trials

145
Q

What is a clinical trial?

A

prospective investigation comparing the effects of an intervention with a control
gives more information about causation than observational studies

146
Q

Define intervention re RCTs?

A

factors introduced into the lives of the research subjects and expected to change their outcomes

147
Q

Define controls re RCTs?

A

the status quo for research subjects that is not expected to change their outcome

148
Q

Describe RCTs

A
  • Select a sample from the population
  • Individuals are randomly assigned to groups
  • One group is given the treatment, the other is control
  • Check for outcomes in both groups.
149
Q

Advantages of RCTs

A
  • most scientifically rigorous design
  • provides the best evidence of causation
  • can give data when observational studies cannot
  • clear cut aims
  • clear cut endpoints
  • clean and precise (well matched groups, clear inclusion and exclusion criteria)
  • deals with incidental outcome-related factors, many other sources of bias
150
Q

Disadvantages of RCTs

A
  • the setting tends to be artificial (trade-off between closely matched groups and being able to apply that to a broader community)
  • adherence to protocols may be difficult
  • inclusion and exclusion criteria are still relevant
  • costly and resource-intensive
  • may require many subjects
  • ethical concerns preclude many experiments 1) patient sacrifice, 2) no patient can be exposed to significant risk
151
Q

What are some limitations of RCTs

A
  • the existence of several RCTs does not mean that a clinical question is settled
  • practical limitations may reduce feasibility
  • rare conditions
  • reluctance to allocate treatments randomly
  • long time before results are known
  • practice may be well-established
152
Q

What are some important design elements of RCTs

A
  • Ethics. Under what circumstances is it ethical to assign treatment at random, rather than as decided by patient and physician. Beneficence, non-maleficence, respect for autonomy (informed consent), justice (equity, impartiality, fairness). Equipoise: the randomisation is ethical if there is no compelling reason to think that one of the allocations is better than the other. They must know that they can withdraw from the study at any time. The trial must be stopped if there is enough evidence that it is effective, if there is significant harm, or if there is futility in continuing.
  • subject eligibility. Subjects are required to meet rigorous criteria, promotes homogeneity and strengthens the internal validity at the expense of external validity. Inclusions involve applying strict diagnostic criteria. Exclusions usually include co-morbidities, survival, contraindications, refusals and non-compliers
  • intervention. 1) generalisability. Is this intervention likely to be implemented in usual clinical practice. 2) complexity. does the intervention represent the normal complexity of the real-world treatment? 3) strength. Is the intervention sufficiently different from currently available alternative management strategies?
  • comparison groups. 1) no intervention group, 2) comparison group that is part of the group (Hawthorne effect - patients adjust their behaviour while being observed, patients who volunteer often work hard to make sure good results are obtained), 3) usual care group, 4) placebo treatment, 5) an alternative intervention - comparison may be patients who receive the current best treatment
  • outcome measurements. objective measurement?, If there are composite outcomes, is the effect size of each component assessed separately?, are the outcomes clinically important (not just statistically)?
153
Q

Hawthorne effect

A

cumulative change in the outcomes for a number of things. The total effects of treatment are the use of the contributions made by the natural history, the Hawthorne effect (effect that people in the study put in effort to get a good outcome), usual care, placebo effect and a new intervention *RCTs must be able to filter out all the above effects except for the new intervention

154
Q

How is treatment allocated in RCTs?

A
  • randomisation: provides the best opportunity to create truly comparable groups (baseline characteristics in Table 1 - ideally prognostic factors)
  • measures differences in the groups that may occur after randomisation: patients disease status, compliance, cross-over (patients move from one intervention group to another), co-interventions (patients receive a variety of interventions other than the one being studied - patient decides to take up an exercise regime)
  • blinding: can occur at various different levels in the study design
155
Q

What is blinding?

A

Strategy used to ensure participants are unaware of the treatment group they are in. Ensures behaviour and reporting does not systematically affect the results. Blinding can occur at 4 different places in an RCT: 1) after randomisation and before the treatment: investigators allocating participants to group aren’t aware which treatment is being assigned next, 2) patients or participants are unaware of the treatment group they are in, 3) physicians caring for people in the study may be unaware of the treatment the patients are receiving - patients are treated equally regardless of their intervention 4) assessment of the outcome: outcome measurements are assessed by people unaware of the treatment group the participants are in; preventing bias in measurement or analysis

156
Q

Treatment effects can be measured in 3 ways?

A
  1. Relative risk reduction
  2. Absolute risk reduction
  3. Number needed to treat
157
Q

What is relative risk reduction? What is the equation?

A

the amount by which the treatment has reduced the relative risk.
RRR = (control event rate - treated event rate / control event rate)
or 1 - RR; where RR > 1 = relative risk increase (RRI)

158
Q

What is the absolute risk reduction? What is the equation? (same as attributable risk - different contexts)

A
  • the absolute difference in rates in the control group compared to the treated (experimental) group
    = control event - treated event rate (benefit)
  • absolute risk increase = treated event rate - control event rate (harm)
159
Q

Define number needed to treat? What is the equation?

A
  • number of patients who would have to be given the experimental therapy in order to prevent one adverse event (e.g. death or other complication) from occurring, in a specified period.
    NNT = 1/ (control event rate - treatment event rate)
    NNT = 1/absolute risk reduction
    *number to treat should be rounded up, for whole people
160
Q

Define number needed to harm? What is the equation?

A

number of patients who, if they received the intervention would lead to 1 additional patient being harmed, during a specified period.
NNH = 1 / (treatment event rate - control event rate)
NNH = 1 / absolute risk increase

161
Q

Define intention to treat analysis

A
  • Principle of analysing outcomes of the RCTs according to the groups that participants were originally assigned in regardless of whether they underwent the intervention.
  • Advantages: provides information to the clinician about what treatment choice is best at the time the decision is made? - do you offer the treatment? Important that groups are analysed according to how they were originally randomised in order to retain full benefit of randomisation.
  • Disadvantage: can mask the effectiveness of the treatment.
162
Q

What are some types of RCTs?

A

parallel group studies, clustered studies, cross-over studies, factorial design, N-of-1 trials, sequential trials

163
Q

What occurs in a parallel group study (RCT)

A

two or more groups are treated separately or concurrently

164
Q

What is a clustered RCT?

A

naturally occurring groups are randomised and outcomes are counted in patients according to the treatment their group was assigned e.g. randomising patients according to the ICUs that they were admitted into, might be different infection rates in different hospitals.

165
Q

What is a cross-over design?

A

Patients are divided into 2 or more groups. Each group starts with a different treatment. After a time (washout period), treatment assignments change. Effectively, each patient is exposed to the intervention and the control. Requires fewer patients.
e.g. drugs with a short half-life.

166
Q

what is a factorial design?

A

E.g. Group 1 = surgery A drug A, group 2 = Surgery A, drug B, group 3 = surgery B drug A, group 4 = surgery B, drug B
Allows the evaluation of many combinations of treatments simultaneously. Complex, but cost effective.

167
Q

What is a N of 1 trial?

A

= random collection of periods of active and placebo therapy for a single patient. (washouts, random order)

  • according to what is best for the patient
  • must be blinded (patient and physician)
  • evaluates the efficacy of the treatment for long term use
  • treatment must be reversible to allow for repeated treatment
  • outcome is often patient preference or symptom score
  • useful for determining treatment where the activity of disease is unpredictable e.g. migraines, asthma, fibromyalgia
168
Q

What is a sequential trial?

A
  • patients are allocated to whichever treatment has been most successful in previous patients, not commonly used
  • usually reserved for dangerous therapies, vulnerable patients, or deadly or rare conditions
  • random allocation to group A or B; A has the better outcome so then the randomisation is repeated but weighted so that more participants are in group A then the outcome is measured
169
Q

What is a systematic review?

A
  • the identification, selection, appraisal and summary of primary studies addressing a focussed clinical question using methods to reduce the likelihood of bias
  • or an unbiased summary of available evidence of a given topic
  • reproducible
170
Q

What are the methods used in systematic reviews? (6)

A
  • Define the question
  • conduct the literature search
  • apply inclusion and exclusion criteria
  • assess the quality of the studies
  • conduct analysis
  • summarise the evidence
171
Q

What is important in ‘define the question’

A
  • Use the PICO method
  • specify inclusion and exclusion criteria
  • state the methodology: time period of the review, language (might include only english written articles - expenses associated with translation), publication restrictions (using peer-reviewed journals, using popular news sources?, databases used?)
172
Q

What does PICO stand for?

A
fundamental principle in evidence based medicine. An answerable question. Careful inquiry into precise healthcare problems. 
P: patient/population
I: intervention / exposure
C: comparison / control group 
O: outcome
173
Q

What is important in ‘conduct a literature search’

A
  • Decide on information sources: databases accessed, experts consulted, funding agencies, pharmaceutical companies, hand-searching, personal files, Cochrane database, citation list or retrieved articles
  • identify titles and abstracts
174
Q

What is important in ‘applying inclusion and exclusion criteria’

A
  • usually done by a minimum of 2 or 3 researchers
  • pre-set inclusion and exclusion criteria
  • apply inclusion/exclusion criteria to titles and abstracts: obtain full articles for eligible titles and abstracts, apply inclusion/exclusion criteria to full articles, select final eligible articles - assess agreement on study selection
175
Q

What is important in ‘assessing the quality of the study’? (critical appraisal)

A
  • data abstraction: participants, interventions, comparison intervention, study design - make sure they all fit with the methodology
  • results
  • assessment of methodological quality
  • assess agreement on validity assessment
176
Q

What is important in ‘conduct analysis’?

A
  • determine methods for generating the pooled estimates
  • generate the pooled estimates (where appropriate)
  • explore heterogeneity, conduct subgroup analysis
  • explore the possibility of publication bias (publications that only publish positive results)
177
Q

What is important in ‘summarise the evidence’

A
  • Forest plots: shows the point estimate of effectiveness and CI for each study in the review
  • boxes represent the point estimates with the size of the box being proportional to the size of the study (the bigger the box the greater the participants in the study)
  • vertical line marks the point where neither the intervention nor the placebo was more effective
  • horizontal line marks the CI
178
Q

What are some of the features of a Forest plot?

A

summarises

  • number of studies: the rows show the number of studies met the inclusion and validity criteria
  • what studies and when: the first column represents the study and the year of publication
  • pattern of effect sizes: the point estimates taken as a whole, show what the various studies reported for effect sizes.
  • precision of estimates: the vertical line through the boxes.
  • the effects for the “big” studies: the large statistically precise studies (seen by narrow CI and large boxes) deserve more weight than the small ones
179
Q

What is a meta-analysis?

A

statistical technique for quantitatively combining the results of numerous studies measuring the same outcome (- studies must be similar-) into a single pooled or summary estimate
- combines the results from individual studies to increase power
- must have homogeneity of subjects, interventions and outcome measures
(-often in SR)

180
Q

What are other types of systematic reviews?

A
  • literature review: an overview (not reproducible, can be different), not comprehensive
  • a narrative/unsystematic review: discussion piece, doesn’t look at quality of study or do quantitative analysis
181
Q

What is the Cochrane collaboration?

A

worldwide collection of systematic reviews for medical interventions using RCTs

  • set up by Archies Cochrane (doctor/epidemiologist); by not having a repository where summaries of these studies were, the profession was not taking advantage of the relevant RCTs, had no systematic process to put them into practice
  • study resources are always limited, if there were properly designed evaluations in one repository, then the studies could be applicable and used in the medical profession
182
Q

What does public health have to offer?

A
  • descriptions of the health status of populations
  • causation
  • evaluation of interventions
  • natural history
183
Q

define epidemiology

A

study of distribution and determinants of health-related states or events in specified populations, and the application of this study to control health problems

184
Q

define clinical epidemiology

A
  • defined as the science of making predictions about individual patients by counting clinical events in groups of similar patients and using strong scientific methods to ensure that the predictions are accurate
    the application of epidemiological knowledge, reasoning and methods to study clinical issues and improve clinical care
185
Q

What is meant by frequency?

A

How often does the disease occur?

186
Q

What is meant by abnormality?

A

is the patient sick or well?

187
Q

What is meant by risk?

A

What factors are associated with an increased risk of disease?

188
Q

What is meant by prognosis?

A

what are the consequences of having the disease?

189
Q

What is meant by diagnosis?

A

How accurate are tests used to diagnose disease?

190
Q

What is meant by treatment?

A

How does treatment change the course of disease?

191
Q

What is meant by prevention?

A

Does an intervention on well people keep the disease from arising? Does early detection and treatment improve the course of disease?

192
Q

What is meant by cause?

A

What conditions lead to disease? What are the origins of disease?

193
Q

What are the 5D’s or important events in clinical medicine?

A

death - bad outcome if untimely
disease - a set of symptoms, physical signs and laboratory abnormalities
- discomfort - symptoms such as pain, nausea, dyspnea, itching and tinnitus
- disability - impaired ability to go about usual activities at home, work or recreation
- dissatisfaction - emotional reaction to disease and its cause, such as sadness or anger

194
Q

What is evidence based medicine?

A
  • conscientious, explicit and judicious use of current best evidence in making decisions about the care of individual patients
    the application of clinical epidemiology to the care of patients
195
Q

What are the three key parts to making a clinical decision?

A

best available evidence, clinical expertise and patient preferences

196
Q

What are the two fundamental principles of Evidence Based Medicine?

A
  1. EBM builds a hierarchy of evidence
  2. Evidence alone is not sufficient to make a clinical decision. Also needs to be considered:
    - benefits vs. risks
    - inconvenience
    - costs
    - alternative management strategies
    - patients values and preferences
197
Q

Skills needed to practice EBM?

A

diagnostic expertise, in-depth background knowledge, effective searching skills, effective critical appraisal skills, ability to define and understand benefits and risks of different treatments, in-depth physiologic understanding allowing application of evidence to the individual patient, sensitivity and communication skills required for full understanding of the patient context, ability to elicit and understand patient values and preferences and apply them to management decisions

198
Q

Basic principles of EBM?

A

variables, numbers and probabilities, populations and samples, bias and confounding, validity and chance

199
Q

How can you describe a disease?

A

Person - Who is becoming ill
Time - when are they becoming ill
Place - where are they becoming ill

200
Q

Characteristics of time?

A
  • changing or stable?
  • clustered (epidemic) or evenly distributed (endemic)?
  • time-trends: point source, propagated, seasonal, secular, combinations
201
Q

Characteristics of place?

A
  • geographically distributed or widespread (outbreak, epidemic, pandemic) offshore (tsunami), travel
  • climate effects (temperature, humidity, combined effects) e.g. probably not zika virus in Australia because of different mosquitoes
  • urban / sub-urban squatter/ rural
  • relation to environmental exposure (water, food supply)
  • multiple clusters or one?
202
Q

Characteristics of person?

A

age, gender, socioeconomic status, marital status, ethinicity/race/genetic profile, behaviour/habits

203
Q

Define case definition?

A

a set of diagnosis criteria that must be fulfilled in order to identify a person as a case of a particular disease

  • critical component of descriptive epidemiological study
  • provide background, clinical criteria, laboratory criteria - do these fit with the patient
    • there are layers in case classification: probable and confirmed
204
Q

Define variables?

A

measurements of characteristics which have the potential of varying among observations or in the same observation over time.

205
Q

Define data?

A

collection of observations and variables on which inferences are drawn

206
Q

What is nominal/categorical data?

A

Groups are mutually exclusive.
Neither rank nor magnitude.
e.g. religion, race, sex, blood type

207
Q

What is ordinal data?

A

at least 3 options.
rank, not magnitude.
e.g. small, medium, large; pain rating; heart murmurs grades I-IV

208
Q

What is interval data?

A
  • ratio (interval) value has a true zero e.g. temperature, time, length, volume
    both rank and magnitude.
    e.g. age, body weight, degrees celsius
209
Q

What is validity/accuracy?

A

degree to which the data measures what it is intended to measure.

e. g. physical examination, standard protocol is supported with another finding (blood test) to see if they are both in agreement
- accuracy of measuring pain, depression difficult
- results are grouped around the true value

210
Q

What is reliability? (or reproducibility or precision)

A

extent to which repeated measurement of a stable phenomenon (measured by different people, instruments, times) get similar results
- results are grouped around a narrow range

211
Q

When might variation occur?

A
  • biological variation within and between individuals

- measurement variation between instrument and observer

212
Q

Define, adv and disadv of mean?

A

= sum of values of observations / number of observations

  • well suited for mathematical manipulation
  • affected by extreme values
213
Q

Define, adv and disadv of median?

A

= the point where the number of observations above equals the number below

  • not easily influenced by extreme values
  • not well suited for mathematical manipulation
214
Q

Define, adv and disadv of mode?

A

Most frequently occurring value

  • simplicity of meaning
  • sometimes there are no, or many, most frequent values
215
Q

What are mean, median and mode measures of?

A

central tendency

216
Q

Define, adv and disadv of range?

A

from lowest to highest values in a distribution

  • includes all values
  • greatly affected by extreme values
217
Q

Define, adv and disadv of SD?

A

the absolute values of the average difference of individual means from the mean

  • well suited for mathematical manipulation
  • For non-Gaussian (“normal”) distributions, does not describe a known portion of observations
218
Q

Define percentile, decile, quartile

A

the proportion of all observation falling between specified values

  • describes “unusualness” of a value. Does not make assumptions about the shape of the distribution
  • not well suited for statistical manipulation.
219
Q

2 ways of describing data? (reproducibility and accuracy)

A

Central tendency (how accurate) and dispersion (how reproducible)

220
Q

What does a positively skewed curve look like?

A

the tail is towards the positive end

221
Q

What does a Kurtic distribution look like?

A

widespread with big peak, seen in >20,000 patients

222
Q

What does a Platykurtic distribution look like?

A

widespread, flat top, seen in

223
Q

Define normal distribution?

A

describes the frequency distribution of a repeated measurement of the same physical object measured with the same object
*clinical distributions are not normal therefore 2SDs doesn’t collect 95% of variables

224
Q

How is the distinction between normal and abnormal?

A

not distinct, there may be overlap between normal distribution and abnormal distribution

225
Q

How to classify ‘abnormal’?

A

unusual, associated with the disease, treating the condition leads to a better outcome

  • functional decline: ability to do everyday activity vs. U-shaped BMI against mortality rate (lower BMI, higher mortality due to muscle wasting from sickness)
  • depends on whether BMI relating to mortality or BMI relating to functional decline?
226
Q

Difference between linear regression and correlation?

A

Linear regression finds the best line that predicts Y from X. Correlation is almost always used when you measure both variables. It rarely is appropriate when one variable is something you experimentally manipulate. Linear regression is usually used when X is a variable you manipulate (time, concentration, etc.)

227
Q

Difference between primary, secondary and tertiary prevention?

A

Primary prevention aims to prevent disease or injury before it ever occurs e.g. health promotion re drinking, smoking.
Secondary prevention aims to reduce the impact of a disease or injury that has already occurred i.e. have early disease e.g. screening detects presence of disease in healthy people.
Tertiary prevention aims to soften the impact of an ongoing illness or injury that has lasting effects - targets those with fully symptomatic disease e.g.surgery on people with coronary heart disease to prevent future events.

228
Q

What is the general proportion used in expressing clinically relevant frequencies?

A
  • numerator is the number of patients experiencing the event

- denominator is the number of people in whom the event could have occurred (population)

229
Q

What is point prevalence?

A

prevalence is measured at a single point in time for each person

230
Q

What is period prevalence?

A

prevalence present at any time during a specified time period

231
Q

equation for incidence?

A

number of people who develop disease in one year / average number of people in the population

232
Q

What are other ways of measuring frequency?

A
mortality (death rate per 10^n)
morbidity
case-fatality rate
survival rate
cause-specific
attack rate
233
Q

What are ways burden of disease might be expressed?

A
life expectancy 
potential years of life lost (PYLL) 
quality-adjusted life years (QALY)
health-adjusted life expectancy (HALE)
disability-adjusted life years (DALY)
234
Q

Define risk?

A

the likelihood or probability of a positive or negative event occurring

235
Q

Define risk factors?

A

characteristics associated with an increased risk of an event occurring

236
Q

Define exposure?

A

exposure to a risk factor means that a person, before becoming unwell has either come into contact or manifested a certain characteristic that clearly increases the probability of a particular outcome (disease), that for the purposes of the study is the characteristic of interest

237
Q

How might exposure be measured?

A

duration of exposure, current dose, largest dose taken, total cumulative dose, years of exposure, years since first exposure

238
Q

Define latency period?

A

time between exposure and first manifestation of disease

239
Q

Often, the causes of disease are obscured in poor recollection or complex interaction between physiology, behaviour, society and exposure

A
  • common exposure to risk factors: cigarette smoking, high salt high fat diets, alcohol exposure
  • low incidence of disease: e.g. lung cancer most common cancer but still a very low incidence - average GP clinic will see relatively few cases of lung cancer, difficult to draw inferences from infrequent events
  • to detect a small risk, the number of people observed needs to be large enough to see the risk and see difference in exposed and unexposed group
  • for common risks, difficult to disentangle the independent effects of risks, hard to correlate between exposure and disease
  • multiple causes and effects: usually not a close 1 to 1 ratio between a risk and disease *one risk factor might contribute to multiple diseases e.g. stroke
240
Q

Define cause (of a disease or event)?

A

is an event, condition or characteristic (or a combination of these factors) which plays an important role in producing the health outcome (disease)

241
Q

What is a sufficient cause?

A
the factor (or combination of factors) that if present, will inevitably result in disease 
*each sufficient cause has a necessary cause as a component
242
Q

What is a component cause?

A

a factor that contributes to disease causation, but is not sufficient to cause disease on its own

243
Q

What is a necessary cause?

A
any factor (or combination of factors) that is required for the development of disease. Disease can never be present when the factor(s) is absent. 
*have to have the factor, but they may not have the disease
244
Q

Types of factors in causation?

A

Predisposing: age, sex
Enabling/disabling: low income or nutritional status
Precipitating: exposure to risk factor
Reinforcing: repeated exposures

245
Q

What is descriptive epidemiology?

A

branch of epidemiology that describes the amount and distribution of diseases within a population

246
Q

Features of ecological studies?

A
  • special because of time or geography
  • examine relationship between groups of individuals with exposure to a putative risk factor and an outcome
  • exposures are measured at the population, community or group rather than the individual level
  • provide information about an association: may generate or support existing hypotheses
  • prone to bias: ecological fallacy - relationships observed in groups won’t necessarily hold true for the individual
  • can be quickly done, difficult to interpret, not good for causation, may establish links
247
Q

Features of cross-sectional study design?

A
  • the observation of a defined population at a single point in time or during a specific interval. **Exposure and outcome are determined simultaneously.
  • quick and relatively easy
  • useful for determining: prevalence of risk and frequency of prevalent cases
  • prone to bias
  • not useful for rare outcomes or exposures, events of short duration
  • no information on temporal relationship between exposure and outcome
248
Q

Features of case-control study design?

A
  • patients are sampled by outcome; subjects are recruited on the basis of disease (outcome)
  • compared to a group (control) without the outcome with respect to the specific exposure
249
Q

Advantages of case-control study design?

A
  • good for rare outcomes and common exposures
  • relatively fast
  • relatively inexpensive
  • assess multiple exposures
250
Q

Disadvantages of case-control study design?

A
  • depend upon appropriate controls selection
  • prone to bias
  • not good for rare exposures
251
Q

Define controlled clinical trial?

A

participants are assigned to groups and the investigator specifies the exposure for each participant group. Followed over time and outcome is measured and compared between the two groups.

252
Q

Define randomised controlled trial?

A

A clinical trial where the assignment of participants to a group is at random

253
Q

What are measures of association?

A

a wide variety of coefficients that measure the statistical strength of the relationship on the variable of interest
e.g. absolute risk, attributable risks, relative risks, odd ratios, population attributable risks, population attributable fraction

254
Q

What are the four ways to measure risk?

A
  1. Have been exposed and have the outcome
  2. Have been exposed and don’t have the outcome
  3. Haven’t been exposed and have the outcome
  4. Haven’t been exposed and don’t have the outcome
255
Q

What is absolute risk?

A

the probability of an event in a population under study

256
Q

Define attributable risk? What are its synonyms?

A
  • the additional risk (incidence) of disease following exposure, over and above that experienced by people who are not exposed
  • the risk of disease in exposed individuals that can be attributed to the exposure
  • attributable risk = incidence in exposed - incidence in unexposed
    = risk difference / absolute risk reduction (ARR) or absolute risk increase (ARI)
257
Q

Define epidemic

A

the occurrence in a community or region of cases of an illness, specific health related behaviour or other health-related events, clearly in excess of normal expectancy

258
Q

Define endemic

A

the constant presence of a disease or infectious agent within a given geographic area or population group

259
Q

Define pandemic

A

the epidemic affects a large number of people and crossed international boundaries

260
Q

Define cluster

A

an aggregation of relatively uncommon events (or disease) in a space and/or time in amounts that are believed or perceived to be greater than could be expected by chance

261
Q

How does an epidemic come about?

A
  • new appearance of, or sudden increase in, an agent from the environment or from an infected source e.g. change in migration patterns of birds
  • arrival of new susceptibles e.g. decreased immunisation
  • introduction of effective route of transmission e.g. climate change bringing new mosquito species
262
Q

steps in outbreak investigation

A
  • prepare for fieldwork: prepare for the unexpected, immediate response, limited investigation window, research risk factors and previous outbreaks of the disease, make administrative arrangements like supplies, equipment and personnel
  • establish the existence of an outbreak (provisional case definition) e.g. is there a common cause, are there more cases than expected, are there relationships among cases, or are these sporadic, unrelated cases of the same disease, are these cases of similar but unrelated diseases, what constitutes occurrences clearly in excess of normal expectancy - depends on lots of factors inc. temporal variation; - causes of false alarms, change in surveillance or reporting policy, change in case definition, improved diagnosis, improved diagnosis, errors in diagnosis (false positives), increased public awareness and demand for tests, increased reporting
  • verify the diagnosis: take history for clues, establish clear picture of events, speak directly with people who are affected, confirmation using other diagnostic tests
  • define and identify cases: establish/refine a case definition - start broadly and simply, clinical criteria, epidemiological components, laboratory investigations, refine with multi-stage definitions as investigation progresses, identify and count cases
  • characterise the outbreak: undertake descriptive epidemiological techniques to orient the outbreak in terms of person (demographic characteristics and risk factors - denominator problems: who is at risk), place (maps) and time (epidemic curves - provides a clear temporal picture, can characterise the type of outbreak and help identify putative agents, graph frequency by time): 5 W’s What (diagnosis: symptoms, microbiology), who (person: age, sex), When (time) - background rate, time of onset, temporal trends, Where (place) - geographic extent, clustering, Why (causes) - exposures and risk factors
  • develop hypotheses: data collection through interviews and questionnaires, is the hypothesis feasible
  • evaluate hypotheses
  • refine hypotheses: analytic techniques - descriptive statistics, simple measures of associations (RR, OR, RD) with CI, stratified and multivariate analysis, appropriate interpretation of causal relationships, consideration of new hypotheses if needed
  • implement control and prevention measures: often happens concurrently, should occur as soon as information is available, actions to prevent further spread are implemented based on suspected or verified hypotheses, often can’t wait until all facts are established, media and information outlets are important vehicles: 4 broad control measures - 1) sanitation, 2) prophylaxis, 3) diagnosis and treatment, 4) control of disease vectors
  • communicate findings: keep information flowing during an investigation, clearly state the facts, beware of legal implications, aim to improve practice, have a single spokesperson, effective public relations strategy at the time of an outbreak is critically important
263
Q

What does an epidemic curve look like when there is a point source?

A

epidemic curve has a steep upslope and a gradual downslope

264
Q

What does an epidemic curve look like when there is a continuous extended source?

A

Epidemic curve has a sharp upswing, followed by a plateau, then a right tail

265
Q

What does an epidemic curve look like when there is a propagated/contagious source?

A

Serial transmission leads to an epidemic curve with progressively taller peaks

266
Q

Use of cohort studies in epidemics?

A
  • follow a large group of exposed and unexposed people, assume a strong suspicion of the source
  • calculate RR from the attack rates in exposed and unexposed people
  • attack rate = terms for CI in outbreaks
267
Q

Use of case-control studies in epidemics?

A
  • compare exposure among cases and controls
  • analysis involves calculation of OR
  • ideal in clearly defined single outcome, rare disease; large or unidentified population at risk of disease, multiple exposures suspected
  • disadvantages include choice of controls, recall bias
268
Q

What is confounding?

A
  • when the causal relationship is confused because of the presence of an additional factor of interest that is associated with both the cause and the outcome
  • present when you stratify or adjust for a potential confounder and the effect estimate changes by 10%: if the crude and adjusted estimate effects differ by 10%
  • e.g. exercise to obesity prevention; possible confounders are age, sex, BMI, smoking, family history, diet
269
Q

Which confounders are unlikely to have much effect on the results of the study?

A
  • if the association between the confounder and either the exposure or the outcome is weak
  • if the confounder is rare; if more people in the study will not be exposed or many of the people in the study are exposed
  • those confounders to take of are thus relatively common and strongly related to exposure and outcome
270
Q

What are ways to control confounding?

A

implemented during design phase:
- randomisation: patients randomly assigned into intervention/control
- restriction: limiti the number of patient characteristics
implemented during data analysis:
- matching: match each participant in the intervention group with a participant having the same characteristic in the control group
- stratification: data is analysed and results are presented according to sub-group analysis
- standardisation: mathematically adjust the crude rate for a confounding variable so that the relative weight for that confounding variable is the same in each group.
- multivariable adjustment: mathematically adjust for difference in large number of variables simultaneously e.g. logistic regression or cox proportional hazards
- best-case/worse-case analysis: describe how the results would differ under extreme assumptions of selection bias

271
Q

Features of effect modification (or “synergism or antagonism” or “interaction”)?

A
  • occurs when the presence or absence of an additional variable changes the effect of exposure on disease: the variable has an association between the exposure and disease but it increases or decreases the effect
272
Q

How does effect modification differ from bias and confounding?

A
  • bias needs to be removed or prevented
  • effect modification is therefore a finding to be reported
  • epidemiological analysis is generally aimed at eliminating confounding and discovering and describing effect modification