Epidemiology Flashcards

1
Q

Descriptive study definition including study types

A

Measure the occurrence of outcomes.

Can be split into either populations or individuals.

Individuals - case reports, case studies, case series, surveillance and prevalence cross-sectional studies

Populations - Ecological studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Analytical study definition

A

Test the association between exposure and outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can you measure the distribution of disease?

A

Time - year, season, day, hour
Place - country, region, district
Person - age, sex, social class, lifestyle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Four commonly used sources of data

A

Routine statistics
Population censuses
Surveys
Special studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

5 routine statistic sources

A

Death certificates
Birth records
Special disease registers - cancer registries
Communicable disease reports
GP records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an ecological study? (at least 4)

A

An observational design study (no treatment)
Use routinely collected data
Based on groups - not individuals (group is unit of observation), not possible to link exposure to his/her outcome
Uses correlation coefficient (r)
Useful for generating hypotheses, not useful for true exposure risk at individual level.
Can be useful at looking at group level disease e.g. schools

An example could be air pollution and mean bmi of an area

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why use an ecological study?

A

to investigate aetiology and risk factors for disease or evaluate changes in health care policy
generate hypotheses
estimate prevalence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the definition of ecological fallacy?

A

Associations at population level do not imply association at an individual level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the strengths of ecological studies? 8 marks

A

Quick and cheap
Use available data e.g. routine stats
Some factors operate at population level e.g. air pollution
Some exposure data only available at population level
Differences in exposure between areas may be larger than those between individuals in one area
Ability to map ecological data
Can generate hypotheses
Random errors may be smaller for populations than individual exposures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the weaknesses of ecological studies?

A

Data may be collected or recorded differently in different places
Surrogate measures based on the average of the population
Spatial boundaries are artificial
Confounding (lack of data)
Could use proxy measures
Classification challenges
Ecological fallacy
Uncertainty in temporal relationship
Collinearity in variables (i.e. your variables are too similar)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a cross-sectional study? (8)

A

An observational study
Carried out at a single point in time
Snapshot of population health
Collects individual data
Can measure prevalence, not incidence
Cannot prove cause and effect
Good at generating hypotheses
Can be descriptive or analytical (descriptive will describe the data, analytical will investigate risks factors and outcomes, collecting data on outcomes and exposures at the same time)
Typically use surveys to gain data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Examples of national CS studies

A

Surveys - census, national survey for Wales, national attitudes and lifestyle survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

CS design

A

Population has a representative sample (if not using whole pop).

Descriptive:
Then you need a number with disease or exposure and then a number without disease or exposure.

This will allow calculation of prevalence of disease

Analytical:
Will need number with/without disease and then with/without outcome, so 4 sample groups vs 2 groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

CS Study and temporality

A

CS study - difficult to measure temporality, chicken and egg scenario. This temporality issue depends on the exposure e.g. a genetic factor does not vary over time whereas exercise levels will change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Strengths of CS studies

A

Quick
Cheap and simple
Good for chronic diseases
Data on individuals e.g. questionnaires
Can estimate prevalence
Can assess many outcomes and risk factors
Can generate hypotheses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Weaknesses of CS studies

A

No good for acute diseases
No good for rare diseases
Prone to bias
Need high participation rates to be valid
Only a snapshot
Cannot infer temporal or causal relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Bias definition

A

Bias is a consequence of defects in the study design or execution of a study and cannot be controlled for by statistical measures and often cannot be mitigated by increasing sample size.

It is any systematic error in an epidemiological study that results in an incorrect estimate of the association between exposure and outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Name two types of bias

A

Selection bias (differences between groups) in how study subjects are chosen or respond

Information bias (difference between groups) in the accuracy of data on exposure/outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

CS studies key metholodigical concerns

A

Validity and repeatability
Response rates (non-response)
Sampling - how representative of the true population is the sample? (sample size calls done?)
Association is NOT the same as causation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Definition of confounding

A

A spurious relationship between exposure and outcome due to the presence of another variable which is associated with both the exposure and the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How to control confounding?

A

By study design methods:
- randomisation (only in intervention designs)
- restriction (limit entrance to those within specific categories (e.g. age group)
- matching e.g. age or sex

By analysis methods
- stratified analysis
- multivariate analysis (can control simultaneously for several factors)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

The goal of statistical analysis in the context of sampling is…

A

If we wish to say something about an attribute of the population, we take a sample so we can make inferences back to the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Sampling in observational vs intervention

A

In observational you sample participants to observe

In intervention you allocate participants to observe

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Why do we sample?

A

Not possible to collect info from all subjects
Sample can provide reliable info on the population by:
- estimating important population parameters (means, proportions etc.)
- to infer toward the populations (using valid distributional assumptions)
- present inferences using estimates, confidence intervals, hypothesis tests (p values)

We do this to potentially minimise bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is non-random sampling?

A

Non-probabilistic
Judgement (purposive) - selection based on personal (expert) belief about representativeness
Accessibility (convenience) - select the most easily obtained participants
Quota - judgement and accessibility used to achieve specifies group sizes

It does not involve random selection
May or may not represent the population well
Used when researcher lacks a sampling frame for the population
These methods may introduce bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is judgement or purposive sampling?

A

Sample selected based on (subjective) judgement of researcher
This is often used
It is a type of non-probability sampling
You are not attempting to be generalisable
Potential for bias is high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Examples of non-random sampling

A

Quota sampling - people on the street, widely used in market research, strata corresponding to different characteristics e.g. age, sx, race

No sampling frame
Problem -> stratum may not be random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Pro and con of non-random sampling

A

Con - selection units up to investigator - high bias

Pro - convenient, less expensive than probability sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Random sampling is…

A

Selection of participants from the population is random. It is any method that utilises some form of random selection.
Different units in the population should have a known probability of being chosen.

AKA probability sampling

Removes the possibility of bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Sampling unit definition

A

The elements being sampled (e.g. the postcode rather than the individual)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Sampling frame definition

A

the list of all units able to be sampled (the list of all postcodes included)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Sample definition

A

Collection of sampling units drawn from the sampling frame

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Sources of error in sampling

A

Estimate = true value + random error + systematic error (bias)

Sample size (n)
Sampling design
Non-response (unit or item)

Measurement:
- observer or interviewer
- participant or respondent
- instrument e.g. questionnaire
- mode of administration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Does bias effect accuracy or precision?

A

Accuracy - accuracy is crudely defined as the difference between the population value and sample estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Precision definition

A

How closely repeated measurements or observations come to duplicating measured or observed values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Ways of sampling (replacement)

A

With replacement - the chance of selecting the each unit does not change from selection to selection. The same unit could be selected more than once.

Without replacement - the chance of selecting each unit does change from selection. under this strategy the same unit may not be selected more than once.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Simple random sampling

A

gold standard
everyone has equal property
n/N could be 100/100,000 therefore 1/1000 probability to be picked.
This is sampling without replacement
True SRS can be expensive but has low bias, might be inefficient to do as well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

How to do simple random sampling

A
  1. obtain a sampling frame that includes all units
  2. number each unit 1-N
  3. generate n random numbers between 1 and N
  4. if any random number selected is greater than N or already selected ignore it
  5. select each unit corresponding to the random numbers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Systematic sampling process

How does it compare to simple? (1)

A

may be more convenient than simple

process
1. put numbers in a sequence
2. divide the study population size by required sample size to determine interval size
3. choose a random number between 1 and k
4. start at random number and add part 2 each time

e.g. n/10,000 - 200/10,000
pick random number for k, 200/k
if k = 27, 27+50 = 77, 77+50 = 127

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Stratified random sampling

A

We may want independent results for different sub-population e.g. sex, age or location

Simple random sampling are taken for each strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Advantages of stratified sampling

A

increases the chance that sample mean is a precise estimate (e.g. less random error) of the population mean because the sample resembles the study population better

useful if interested in sub-pops

strat samples useful when interested in comparing groups

can use different sample designs if different strata

administrative convenience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

cons of stratified sampling

A

size of the strata in the pop must be known

information on strata characterstics must be known in advance

a weighted analysis is required if not proportionate to the population distribution of strata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

cluster sampling

A

populations may consist of large number of groups e.g. households, schools or hospitals, these can be referred to as clusters

we first take a simple random sample of n clusters and include all sample members of the chosen clusters.

the sampling unit is the cluster not the individual essentially SRS applied to groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

cons of cluster sampling

A

hard to define cluster
less efficient than SRS
members of the same cluster tend to be more similar

Samples might not be independent:
- tends to be less precise
- requires a large sample than srs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

pros of cluster sampling

A

often cheap and convenient to draw sample

only for admin convenience
often we do not have info of the individuals only a group e.g. address of household

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

two-stage sampling process

A
  1. draw up list of all first stage units (called primary sampling units)
  2. select random sample of PSUs
  3. for each PSU selected draw up a sampling frame of second-stage units
  4. select a random sample of the second stage units (SSUs)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Pros and cons of two-stage sampling (4)

A

can reduce cost if collecting data in person

usually less precise, have larger SEs

can be extended to 3 or more stages

req. specially weighted analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

postal questionnaire as a mode of data collection pros and cons

A

relatively cheap and easy, able to cover wide geography, larger

poor response rate, can be completed by not the respondent, not possible in some settings, may appear probabilistic - but actually not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

by enumerator mode of data collection - pro and con

A

high response rate, often greater details given, able to take other non-recorded data e.g. samples

cons - v costly, enumerators can influence and bias responses, hard to over come geographical differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

by the internet, mode of data collection - pro and con

A

very cheap and easy to collect

usually biased, non-probabilistic, difficult to analyse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

summary of sampling design

A

important to ensure good estimates of population parameters as they reduce bias (systematic error) and measure precision (standard errors). random sample reduces chance of bias. not always possible to do SRS so stratified, cluster and two stage is req. more complex designs tend to increase sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

are standardised death rates adjusted or unadjusted?

A

Adjusted by age

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What methods of standardising are there?

A

Direct and indirect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is indirect standardisation?

A

Age-specific death rates from a standard population are applied to the index population - provides a standardised mortality ratio and indirectly standardised rates. It does not require age specific mortality rates of the index population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

what is direct standardisation?

A

age specific rates are taken from the population that we are standardising and applied to a standard population.

Provides directly standardised rates. Age specific-rates are a necessity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

SMR equation

A

Total number of observed deaths / total number of expected deaths x 100 = SMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Steps in indirect standardisation

A
  1. apply age-specific rates from the standard population and calculate the expected number of deaths
  2. sum up number of expected deaths
  3. divide the observed deaths in the index population by the number of expected deaths
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

How to interpret SMR

A

can only compare the index population with the standard population.

An SMR of 100 = similar expected rates
>100 = larger mortality rate
<100 = lower mortality rate
Should include a 95% CI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Direct standardisation method

A
  1. Calculate local population death rates for each category
  2. apply the age specific mortality rates from the index population to the standard population to calculate expected number of deaths
  3. sum up expected number of deaths for the index population using the standard population age groups
  4. calculate overall age standardised mortality rates for the index
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Which standardised mortality ratio method is best?

A
  1. the direct method requires availability of age-specific rates for the study population of interest
  2. the indirect method requires the total number of cases in each population
  3. if numbers are small the indirect method is advisable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

demography definition

A

the study of characteristics of human populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Infant mortality definitions

A
  • Stillbirth – born after 24 or more weeks completed
    gestation and which did not, at any time, breathe or
    show signs of life
  • Perinatal – stillbirths plus early neonatal deaths
  • Early neonatal: deaths under seven days
  • Neonatal – deaths at under 28 days
  • Postneonatal – deaths between 28 days & one year
  • Infant – deaths under one year
  • Rates – neonatal, postneonatal and infant mortality
    rates are reported per 1,000 live births
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What are case control studies useful for?

A

Outbreak investigations
Studying uncommon diseases
Studying diseases with long latency
Generating and testing hypothesis

Quick and cheap to conduct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Do case control cases start with people with the disease or without the disease?

A

With the disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

How do you identify a case for a case control study? (6)

A

Need a precise case definition - must be objective with a validated test e.g. blood test, histology, X-rays or sonograms

It can be classified by diagnostic certainty e.g. confirmed, probable or possible

Whilst precise definitions can limit generalisability, it increases validity

Often req. inclusion/exclusion criteria (who, what, when, where)

Must report on numbers that meet exclusion/inclusion

Must decide whether incident or prevalent cases (i.e. all new cases within fixed time period, or taking all new and old ones)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

Why would you use incident cases for a case-control study? (3)

A

Useful if exposure is associated with recovery or survival

Greater representativeness at its timely

Disease behaviour change less likely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

What are the pros and cons of using existing cases for a case control study? 1 and 3

A

Pro - can be used when its difficult to establish date of onset e.g. study of H.pylori infection

Con - may not be representative of all cases, pts with long course disease tend to be over represented and recall bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

Case control studies - population vs hospital cases

A

Population based studies try and recruit all from a defined population over a fixed period of time. It requires tracing subjects and has issues with completeness and refusal to participate.

Hospital based studies require a clear case definition and protocol adherent to minimise bias as they are more prone to bias (note this won’t score a star in some systems)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

How would you recruit controls to a case control study? What characteristics must they have? (5)

A

Controls should be free of disease and representative of the population.

They must have the potential to become diseased.

They must be from the:
- same source population
- same inclusion/exclusion criteria
- identified as cases if they had the disease while under investigation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

What options are there for selecting controls? i.e. what types of groups of controls can be used

A

Population controls (random from registry/list/directory)

Neighbourhood controls

Friends/family controls

Hospital controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

Pros and cons of using population controls

A

Can gain a random sample

Limitations: can be difficult and time consuming, healthy people may not participate, may be low response rate, selection bias (e.g. if using a register with phone numbers some may not have phone)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

Neighbourhood controls pros and cons

A

Pro
No need for population register
Controls for social factors e.g. deprivation

Con
Poor cooperation
Can be time inefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

Friend and family control group pro and con

A

Pro - no need for pop registers, quick and efficient, easily to control for social factors so may reduce confounding

Con - overmatching, selection bias (i.e. family members may be different from gen pop)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

Hospital controls pro and con

A

Pro - easy to identify, relatively cooperative

Cons - overmatching (more likely to be sicker and have higher risk factor exposures), selection bias (catchment population differs for different diseases)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

What may affect your case:control ratio?

A

If ample cases can be 1:1 for sufficient power

If rare outcomes can go 1:1 - 1:4 , >1:4 is a waste of resources as minimal increase in statistical power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

How can we control for confounding in case control studies?

A

Study design methods
- randomisation (can’t do this one)
- restriction (can be done)
- matching (can be done, risk of over-matching)

Analysis methods
- stratification
- multivariable analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

What is overmatching?

A

Matching is too close or elaborate. Controls become difficult to find and may fail to find a true causal association or may underestimate the association.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

Odds ratio vs relative risk

A

Odds ratio is simply an association, with no indication of temporality. Incidence of disease is unknown. Relative risk is usually interpreted as the risk of having outcome of interest if you have the exposure i.e. if temporality is indicated. Odds tend to approximate OR. If the prevalence of the disease is <10%, the relative risk and OR can approximate each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

What types of biases are case control studies known for?

A

measurement bias (observer, responder and limitations of instruments) and selection bias (selection of cases and controls)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

Case control strengths

A

Good for study for rare diseases

Can use small sample size

Makes use of available data
Rapid
Low cost
Suitable for diseases with long latency
Can examine multiple exposures for a single disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

Case control studies limitations

A

Cannot directly measure relative risk
Not suitable for rare exposure
Temporal relationship exposure-disease difficult to establish
Prone to several biases - selection of controls, recall when collecting data
Loss of precision due to sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

what is a cohort study?

A

an observational study which follows up two or more groups of people from exposure to outcome. A simple cohort study has an exposed or unexposed group. Participants that develop the outcome are recorded and rates in the two groups compared.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

members of a cohort study must be…

A

free of disease at the start of the study and at risk of the outcome being studied.

for common exposures, cohort may be drawn from gen pop. for rare exposures, a specific group may be more suitable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

Name two types of cohort studies and what they are

A

prospective cohort studies - start before outcome has occurred. cohort is disease free at the start of follow up. data is captured prospectively

retrospective (historical) cohort studies - start after outcome has occurred. relies on medical records or routinely collected data. data on measurements, exposure and outcome are collected retrospectively after the events have happened

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

risk definition

A

the probability that an event will occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

when is risk usually used as a method of association?

A

in RCTs and cohort studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

relative risk definition

A

the risk of a particular outcome (Disease) when a particular exposure (risk factor) is present

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

what is easier for people to understand, relative risk? or absolute risk?

A

absolute risk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

what is absolute risk denoted as?

A

I(subscore)e

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

how is attributable risk measured?

A

Incidence in p - incidence in unexposed / incidence in the whole population

91
Q

What are population measures?

A

To assess the extra disease incidence in the whole study population that can be attributed to the exposure we can use measures of population impact

92
Q

Relative risk measures summary 3 things

A

measures strength of association between exposure and outcome

can be generalised to other populations

also known as risk ratio and rate ratio

93
Q

absolute risk measures summary 3 things

A

measure the impact of an association i.e. number of cases that could be prevented by eliminating the exposure

relies on baseline incidence in the unexposed which vary in different populations, therefore cannot be generalised

also known as excess risk, risk difference, rate difference, attributable risk

94
Q

why us a cohort study?

A

to measure incidence
find aetiology
quantify risk factors
describe prognosis
evaluate treatment outcome

95
Q

strengths of cohort study

A

can directly measure: incidence in exposed and unexposed groups, true relative risk

temporal relationship between exposure and disease is clear

can examine multiple effects for a single exposure

less prone to selection biases (outcome not known - prospective)

96
Q

weaknesses of cohort studies

A

require large sample size
not suitable for rare disease
not suitable for disease with long latency
problems with losses to follow up
can be difficult to measure multiple exposure
exposures may change over time
time-consuming and costly

97
Q

when is hazard ratio used?

A

in survival analysis, usually cancer trials

outcomes can be negative or positive (e.g. relapse vs remission, disability vs disease-free, death vs cure)

98
Q

what is a hazard?

A

an instantaneous event rate

the probability that a person will experience an event at a specific point in time (rather than cumulatively)

99
Q

what a hazard ratio?

A

the effect of a particular intervention on a particular outcome (negative or positive) per unit time

100
Q

what type of curve is a Kaplan Meier curve?

A

time to event curve

101
Q

how can hazard ratios be used?

A

to see whether a treatment shortens an illness duration

relative risk of a complication in treatment vs control

types of individuals more likely to experience an event first

102
Q

what are the five steps of evidence based medicine?

A
  1. ask
  2. acquire
  3. appraise
  4. apply
  5. evaluate
103
Q

Four things to consider when reading papers?

A

Does this study address a clearly focused question? (PICO)

Did the study use valid methods to address the question?

Are the valid results of this study important?

Are these valid and important results applicable to my patient or population?

104
Q

what tools can be used for critical appraisal?

A

Critical appraisal checklists, critical appraisal skills programme (CASP) checklists are one example but widely used

105
Q

what is a clinical trial?

A

a planned experiment on human beings which is designed to evaluate the effectiveness of two or more forms of treatment

106
Q

how many phases are there in a trial?

A

we start with can we give therapy to should we give therapy.

107
Q

what happens in phase 1 of a clinical trial?

A

testing in small group of people (20-80) to determine a safe dosage range and pharmacological effects of a drug. generally first time testing in humans

108
Q

what happens in phase II of a drug?

A

initial study of efficacy, dose or technique based on phase 1, collects adverse events data. some are randomised, other’s aren’t.

109
Q

what happens in phase III clinical trials?

A

full scale evaluation of treatment, study efficacy in large groups (100-1000s) comparing new intervention to a standard intervention. designed to detect a clinically meaningful difference. adverse event monitored. this is only done if 1and2 are okay though.

110
Q

what happens in phase iv of clinical trials?

A

post marketing surveillance, to assess longer term risks assoc. with intervention

111
Q

trials can be controlled or randomised what does this mean

A

controlled - the responses of a group of patients on the new treatment with a control group of similar patients receiving a standard treatment / placebo

randomised - each patient should be randomly assigned to a new treatment group or control group, for unbiased evaluation. there has to be a known, often equal chance of being assigned to each group

112
Q

what should be included a in protocol for an RCT

A

a blueprint for the study, who, what, when, where, why, how, how much

113
Q

what randomisation methods can be used for RCTs? give a small explanation of each method

A

simple randomisation (tossing a coin)
blocked (restricted) randomisation - used to keep number in each group close at all times
stratified randomisation - classed into subgroups (strata), random allocation in each subgroup
minimisation - adaptive design to ensure best possible balance at all times

114
Q

when randomisations fails in an RCT, what could have happened?

A

accidental bias, where a study fails to balance groups, which is more prone in small sample sizes

‘gaming’ the system when allocation is predictable e.g. using DOB for randomisation, small block sizes, non-opaque envelopes

115
Q

how might you conceal allocation in an RCT?

A

sealed, opaque envelopes
tables of random numbers
computer generated randomisation lists
central allocation (inc. telephone, web-based and pharmacy controlled randomisation)
sequentially numbered drug containers of identical appearance

116
Q

blinding definitions. follow up why do we blind?

A

keeps randomisation status secret

from patients (single blinded)
patients and clinicians (double blinded)
patients, clinicians and researchers (triple blinded)

psychological effects in patients, recording bias by clinicians

117
Q

in an RCT, what is primary and secondary outcomes

A

needs a clearly defined, measureable and objective measurement. this is used to determine sample size.

secondary outcomes need to be defined a priori to avoid fishing expeditions

118
Q

what type of analysis is done in RCTs? why?

A

intention to treat analysis.

this is designed to manage non-compliance, non-adherence and losses to follow up.

all randomised participants are analyses whether they completed/received treatment they were randomised to. failure to include these will lead to biases results.

119
Q

what are reasons for non-adherence and loss to follow up in an RCT?

A

side effects
forgetting to take meds
withdrawing consent
choosing alternative treatment

can lead to reduced statistical power and bias

120
Q

what are common sources of bias in RCTs

A

selection bias due to differences in baseline characteristics with respect to diagnosis

performance bias due to differences in case (other than treatment)

attrition bias (due to differences in withdrawal from trial)

detection (ascertainment) bias due to differences in outcome assessments

121
Q

what is a crossover design?

A

randomised to a sequence of treatments
each person serves as their own control.

washout period between the treatments. switching from placebo to treatment for example.

122
Q

what is factorial design?

A

multiple treatments occurring at once. could be placebo + some type of treatment or vice versa or any combination

123
Q

what are the ethical and methodological requirements for an RCT?

A

clinical equipoise (uncertainty in the expert community

informed consent

methodological rigour (statistical and operational)

registration and reporting via registries inc. CONSORT (inc. statement, checklist and flow diagram)

124
Q

what ethical considerations are there for an RCT?

A

is the proposed treatment safe?
can the treatment be ethically withheld?
are all potential participants suitable for randomisation?
is it ethical to use a placebo or use blinding?

may need to stop a trial early

125
Q

what data monitoring standards and agency is there

A

independent data monitoring committee which can recommend ending a trial

126
Q

strengths of RCTs

A

most reliable scientific evidence
high internal validity as exposure to treatment is random, potentially can eradicate bias and isolate the treatment effect

provide true measure of efficacy and allow for meta-analysis

127
Q

weaknesses of RCTs

A

expensive and time consuming
limited external validity (generalisability) strict eligibility or insufficient suitable participants

limited scope (difficult for rare events/distant outcomes)
conflicts of interest
ethical considerations

128
Q

why do we need systematic reviews?

A

> 2million article a year and >20k boomed journals
need for consistency and precision
informed decision making
inform research agenda

129
Q

what are systematic reviews?

A

review of published/unpublished studies
to identify, select, appraise and synthesise all relevant evidence to address a specific question. it is designed to be systematic and reproducible approach to minimise bias. It takes into account quality of evidence and may or may not include a meta-analysis

130
Q

What acronyms can be used to formulate a question for a systematic review?

A

PICO (population, intervention, comparison, outcome)
SPICE (setting, population, intervention, comparison, evaluation)
SPIDER (sample, phenomenon of interest, design, evaluation, research type)

131
Q

what should a search strategy include?

A

databases, study registers, references, key journals, contact with expert in the field need to be looked at.

  1. search general databases e.g. medline
  2. search specialised databases e.g. cochrane
  3. check reference list of key articles
  4. hand search key journals
  5. contact experts in the field
132
Q

how are studies selected for a systematic review?

A

eligibility of studies is checked independently by two reviewers and disputes req. a third reviewer. This is based via exclusion/inclusion criteria. A prisma Flow chart should be used to show excluded papers.

133
Q

what is the quality of individual studies assessed?

A

critically appraised and scored to assess quality and bias. a standard tool to assess quality is used e.g. Jadad scale, Cochrane risk of bias tool or a standard tool to assess the quality of evidence overall e.g. GRADE

134
Q

what sort of things might be included in a quality checking scale for RCTs for a systematic review?

A

whether randomisation was used AND described

was the study double blind AND described how it was double blind

are withdrawals / dropouts acknowledged

135
Q

how is data analysed for a systematic review?

A

are results consistent? how are outcomes defined and measured. is there heterogeneity or subgroup analysis?
can a meta analysis be used?

the data analysed should be defined a priori

136
Q

What statistical measures would you use in a meta-analysis for the following types of data? Continuous, dichotomous, time till event or other

A

Continuous - change from baseline, standard mean differences
Dichotomous data - OR, RR
Time till event - HR
Others - interrupted time series; incidence or prevalence

If not combinable, needs a descriptive or narrative synthesis

137
Q

How do you measure heterogeneity in systematic reviews?

A

Variability between studies can be due to participant characteristics, outcome measures, interventions or methodologies.

Chi2 = variability due to change
I2 = variation in magnitude and direction, describes the amount of variability

138
Q

What % for I2 statistic for heterogeneity is substantial? What might you do if it it high?

A

> 60%
avoid pooled analysis, narrative synthesis only and use sub-group analysis

139
Q

what assumptions are made for a sensitivity analysis? (systematic review)

A

fixed (low heterogeity) or random effects (high heterogeneity) model (fixed assumes effect of intervention is constant in all studies population)
drug dosages vary between studies
missing data

do the results still stand if you use only the best quality studies (lowest risk of bias)

140
Q

how is publication bias assessed in systematic reviews?

A

funnel plots
a plot of each trial’s effect size against some measure of sample size

statistically significant results tend to get published

you will see a gap on the non-statistically significant side of a funnel plot

141
Q

how would you assess if results apply to your population?

A

precision of treatment effect - how wide are 95% CIs
what are implication for clinical/public health
implications for research

142
Q

how would you assess if an association is real? what 4 themes would you look at

A

whether true association i.e. causal vs non-causal
if result is due to either chance, confounding or bias

143
Q

definition of bias (again!)

A

any systematic error in an epidemioglocial study that results in an incorrect estimate of the association between exposure and outcome

144
Q

what is selection bias?

A

a difference in how participants get into the study, this could be an error in identification or preferential selection of partipcants based on case/control status or exposure status

145
Q

name 3 types of selection bias

A

ascertainment bias when members of a population are more likely to be included that others due to surveillance systems or diagnostics or referral/admission to hospital, can skew outcome e.g. wealthier countries might have higher rates of breast cancer because of better detection due to screening rather than higher rates of breast ca

sampling bias means there is an inappopriate comparison group when the control group isn’t appropriately selected or representatiative of general pop

participation bias is when people in the study are different to those that are part of general population this could be due to volunteerism, worried well, non-response, refusal or survival (health worker effect)

146
Q

case-control studies are most associated with what type of bias

A

selection bias

147
Q

what is information bias?

A

difference in how data on participants are collected

differences in accuracy of exposure data/outcome data for cases/controls or study subjects wrongly classified

148
Q

what types of information bias are there?

A

reporting bias (recall bias)

observer bias (interviewer or instrument bias)

misclassification bias (participants wrongly categorised)

149
Q

what is the definition of misclassification?

A

measurement error leads to assigning wrong exposure or outcome category

150
Q

how can misclassification be sub-divided?

A

non-differential - when all individuals have some probability of being wrongly classified. Random error, unrelated to exposure or outcome status, not a bias but weakens measure of association.

differential when errors in exposure or outcome status depends on the outcome or exposure. This is a systematic error, related to the exposure or outcome status, this results in bias and affects the measure of association in any direction

151
Q

how do you minimise information bias?

A

standardise methods of data collection with objective questions/measures and training/blinding of interviewers.

use multiple sources of information

and use prompts to aid recall

152
Q

common bias in case-control and retrospective cohort studies

A

ascertainment bias, participation bias and interviewer bias (exposure and disease have already occurred; differential selection or data gathering in cases and non-cases

recall bias - cases may remember exposures differently from controls

153
Q

common bias in prospective cohort studies

A

loss to follow up (main concern) as major source of bias, do we assume they do or do not develop outcome?

ascertainment and interviewer bias (knowing exposure may influence how outcome determined)

non-response or refusals, little concern bias only arises if related to both exposure and outcome

recall bias is not a problem as exposure is determined at time of enrolment

154
Q

name two types of prevalence

A

point and period prevalence

155
Q

name two types of incidence

A

incidence proportion (cumulative incidence) and incidence rate (incidence density)

156
Q

what is the point prevalence equation?

A

number of cases of disease at a specific time divided by population at risk at the specific time

157
Q

equation for period prevalence

A

number of cases of disease during a time period divided by population at risk midway through the time period

158
Q

what would increase prevalence?

A

increase in new cases
longer duration of the disease
increase in survival (without cure)
in-migration of cases
out-migration of healthy people
improved diagnosis or better reporting

159
Q

what would decrease prevalence?

A

decrease in new cases
shorter duration of disease
high case-fatality rate from the disease
in-migration of healthy people
out-migrations of cases
improved cure rate

160
Q

incidence proportion equation

A

number of new cases of disease during a period divided by the population at risk at the start of the period

161
Q

incidence rate equation

A

number of new cases of disease during a period / total person time at risk

this can be expressed as person-years as long as it is time and person aka force of morbidity or mortality incidence density

162
Q

risk definition

A

the probability than an event will occur

163
Q

odds definition

A

probability than an event will happen / probability that an event will not happen

164
Q

mortality rate definition

A

incidence of death in a population

165
Q

mortality rate equation

A

( total no. of deaths from all causes in 1 year / no. of persons in the population at mid year ) x 1000

166
Q

type of mortality rates

A

crude mortality rates - total number of deaths per year per 1,000 (or 100,000) people

age-specific mortality rate - total number of deaths per year per 1,000 people of a given age

cause-specific rate - number of deaths due to a particular cause per year per 1,000 (or 100,000) population

167
Q

case fatality rate equation

A

no. of people dying during a specified time after disease onset or diagnosis / no. of individuals with the specified disease ) x 100

mortality rates denominator = entire population at risk of dying

CFR denominator = those who already have the disease

168
Q

what is proportional mortality ratio (PMR) and what does it show

A

number of deaths from a specific cause during a specified time period / all deaths during that time period. it shows the relative importance of certain cause of deaths in relation to all deaths in that population.

hard to compare different populations due to varying denominators

169
Q

maternal mortality rate equation

A

no. of pregnancy related deaths by place and time / average number of women of reproductive age in the same population or time frame x 100,000

170
Q

what is the rate for infant mortality rate

A

deaths at age under one year per 1,000 live births

171
Q

stillbirth rate

A

foetal births from 28 (24) week per 1000 total live and still births

172
Q

perinatal mortality rates

A

stillbirths and deaths in first week of life per 1000 live and stillbirth

173
Q

neonatal mortality rate

A

deaths at age under 28 days per 1000 live births

174
Q

child mortality rate

A

deaths age under 5 years per 1000 live births

175
Q
A
176
Q

How do ratios help in comparing disease between populations?

A

Ratios allow comparison by considering the number of cases in relation to the size of different populations, making it easier to make meaningful comparisons regardless of population size.

177
Q

How is proportion different from ratio?

A

A proportion is a fraction where the numerator is a part of the denominator, representing a part of a whole, while a ratio compares two separate quantities, and can be used to compare two different population groups.

178
Q

Compare and contrast point prevalence, period prevalence, and lifetime prevalence with examples.

A

Point Prevalence: Measures the proportion of the population with a specific condition at a single point in time (e.g., the number of people experiencing an asthma attack on a particular day).
Period Prevalence: Measures the proportion of the population that experienced the condition over a defined period (e.g., the number of people who had an asthma attack in the month of January).
Lifetime Prevalence: Measures the proportion of the population that has ever had the condition at any point in their lives (e.g., the percentage of people who have ever had an asthma attack).

179
Q

Risk definition and equation

A

The risk is the probability that a subject within a pop-
ulation will develop a given disease, or other health out-
come, over a specified follow-up period.

Risk =
Number of subjects developing the disease over a time period /
Total number of subjects followed over that time period

180
Q

Incidence rate, definition and equation

A

It can be calculated by
dividing the number of subjects developing a disease by
the total time at risk for all people to get the disease. The
denominator of this formula includes a measure of time
instead of just a number of subjects. The incidence rate
should therefore be interpreted as an instantaneous con-
cept, like speed.

Incident rate = number of subjects developing the disease / total time at risk for the disease of all subjects followed

181
Q

How would you calculate an average waiting time from an incident rate?

A

Under conditions in which rates do not change with
time (a steady state), the incidence rate can be interpreted
as the reciprocal of the average time until an event occurs,
also called the waiting time. For example, in the calcula-
tion of the incidence rate of vascular access infections in
HD patients, the average waiting time for such an episode
to occur would be 1/0.54 = 1.85 years.
When calculated over a short period of time, the risk
and the incidence rate will be rather similar, because the
influence of loss to follow-up and competing risks which
may flaw risk will only be small

182
Q

Key features of ecological and cross-sectional studies (revision slides)

A

Population level information
Snapshot in time
Examine association between amount of risk factor in an area and amount of disease in the area
Uses aggregated data at the population level
Does not use data on individual subjects and therefore subject to ecological fallacy
Often use scatterplots to demonstrate associations

  • Think fish consumption and heart disease.

Ecological studies are useful for generating hypotheses

183
Q

CS studies from revision slides

A

Surveys - Census, Welsh Health Survey, National Attitudes and Lifestyle Survey

Information not available from routine data to answer specific questions:

What is the caesarean section rate in hospitals in England and Wales?

What is the prevalence of psychological distress following the summer 2012 floods in South Yorkshire?

Usually descriptive but can be analytical

184
Q

Definition of population attributable risk (PAR) or population excess risk

A

Population attributable risk (PAR) is the proportion of the incidence of disease in a population (exposed and non exposed) that is due to exposure.

PAR = Incidence rate in total population – incidence rate unexposed.

It is the incidence of a disease in the population that would be eliminated if exposure was eliminated.

185
Q

Population attributable fraction (PAF) definition

A

The proportion of the disease in the population that is due to the exposure, assuming causality

= (Ipop - Iu)/Ipop

also = [Pe x (RR-1)] / [Pe x (RR-1) + 1]
Ipop = incidence rate in the population
Iu = incidence rate in unexposed
Pe = proportion of population exposed aka = prevalence of exposure
RR = relative risk

186
Q

What does PAF (%) answer?

A

What proportion of the new cases of disease observed in the study is attributable to a risk factor?

187
Q

What depends on PAF?

A

PAR & PAF depend on both:
the strength of the association (RR)
the prevalence of exposure in the population (Pe)
to have a large impact on the population the exposure must be common

PAF provides important information about the potential impact of prevention programmes and interventions in public health

188
Q

Compare and contrast case-control vs cohort

A

CS vs cohort
Sample Size small vs large
Cost/Time less/short vs more/long
Rare disease good vs bad
Rare exposure bad vs good
Multiple exposures good vs bad
Multiple outcomes bad vs good
Rates of disease novs yes
Recall/selection bias yes vs no
Loss to follow up no vs yes

189
Q

what is a generalised structure for an RCT?

A
  1. Identify population of interest, select sample i.e. using sampling frame
  2. record baseline measurements on entire sample
  3. randomly allocate subjects to group A (intervention) and group B (control).
  4. ITT analysis of results
190
Q

What 9 questions would you ask for an RCT?

A
  1. Appropriateness of the strategy
  2. Representativeness of sample - can result be generalised
  3. has random allocation worked?
  4. has bias been introduced through losses to the trial?
  5. is the trial clinically relevant? is the outcome a disease, or simply a mechanism in a disease?
  6. is the power of the trial adequate? is the trial large enough and sensitive enough? are the results statistically significant but clinically unimportant?
  7. are the results consistent with evidence from other sources. Hence Bradford criteria for testing an association
  8. is the trial a test of management or of disease i.e. efficiency and effectiveness in Cochrane terms
  9. cost - other strategies and risk/benefit balance
191
Q

Domains of cochrane risk of bias tool used to assess the study methods for:

A

random sequence generation
allocation concealment
blinding
selective reporting
other bias

192
Q

What 9 things are Bradford Hill’s conditions

A
  1. strength of association - the stronger the more causal (OR/RR)
  2. consistency - if relationship is causal we would expect the finding to be consistent with other data
  3. specificity - this suggests one exposure = one disease, harder to use now
  4. temporality - does exposure occur before disease development
  5. biological gradient (dose response) - as dose of exposure increases risk of disease also increases
  6. plausibility - sits within current body of evidence
  7. coherence - replication of finding in different situations, consistent with sub-groups and different population unless plausible cause
  8. analogy - similar findings between observed association and other associations
  9. experimental evidence - similar finding between lab run and observational studies
193
Q

Absolute risk definition

A

The incidence of an event in exposed people

194
Q

Risk difference definition

A

The difference between the incidence in exposed people minus the incidence in
unexposed people (Ie-Iu).

195
Q

Relative risk definition

A

Incidence of event in exposed group/incidence of event in the unexposed group (Ie/Iu)

196
Q

Cumulative incidence definition

A

Total number (or proportion) of a group of people who experience a new event during a
specified time period

197
Q

Cross-sectional study definition

A

Observe a population at a point in time. They are descriptive studies. Useful
in assessing prevalence rates etc. They can show association but not
direction of causality due to lack of longitudinal data.

198
Q

Meta-analysis definition

A

The statistical analysis of the data/results from homogeneous studies with
the same outcome of interest to produce an overall, pooled result of the
treatment/interventional effect. Most commonly used for RCTs but can be
used for observational studies too. Useful when studies are too small to
show a significant difference, combining the results of several studies
increases the sample size and power enabling a more precise measure of
effect

199
Q

If you found an I2 score of 79% what would you do?

A

This implies severe/strong heterogeneity (1 mark)
* Refit the analysis with a random effects model (1 mark)
* Investigate the heterogeneity using subgroups (1 mark)
* Carry out a sensitivity analysis (1 mark)
* If you can’t explain the heterogeneity - do not pool the data (1 mark)

200
Q

Define primary outcome

A

The leading outcomes that determine the success or effectiveness of an intervention

201
Q

Forest plot definition

A

The graphical/visual result of a pooled meta-analysis, it presents the pooled effect
size and confidence interval

202
Q

Funnel plot definition

A

A comparison of study size and effect size, it is a tool to assess publication bia

203
Q

Publication bias definition

A

Publication bias is the term used to describe the tendency for positive and negative
results to be unequally reported in published literature. Negative results are more
likely to not be reported. Reviewers should aim to minimise publication bias. Funnel
plot can be used to assess bias.
8

204
Q

Internal and external validity definitions

A

Internal validity means
that the study measured what it set out to; external validity is the ability to generalise from the study to the reader’s
patients.

205
Q

What might cause variability between studies? aka heterogeneity

A
  • participant characteristics e.g. age group
  • outcome measures e.g. pain-free vs disability
  • interventions e.g. drug dosage, Rx duration
  • methodology e.g. randomisation vs none
206
Q

Temporal ecological studies characteristics

A

Temporal ecological studies measure exposure and the outcome in the same population at different points in time
Or between 2 different populations
They can also be referred to as time-trend analysis or longitudnal ecological studies
This data enables a dynamic view of a population’s health
These studies can be used to examine
rates of disease
mortality
changes in health behaviours
Comparisons can be geographical, different time periods or patterns of change over time
Generate hypothese, identify trends
Do NOT prove causality

207
Q

What is a nested case control study?

A

An alternative approach is to use a design in which a case-control study is nested within the cohort. In this approach, the cohort is identified and followed up until a sufficient number of cases develop. All these cases, and a random sample of controls among those who have not developed disease in the cohort, become the “case-control” study. Additional information is usually collected and the analysis is carried out.

208
Q

When can controls be selected in a nested case-control study?

A

This design is a type of population based case-control study as both cases and controls are drawn from the same population (the cohort).
In this study design, the controls can be selected in three different ways:
- Controls are sampled form the population still at risk at the end of the follw up period i.e. those still disease-free at the end of the study. In this circumstance, a cohort subject can only be a case or a control but not both.
- Controls are sampled from those who are still at risk at the time each case is diagnosed, so that controls are time-matched with cases. In this circumstance, a cohort subject originally selected as a control can later become a case.
- Controls are sampled from those who were at risk at the start of the cohort. This design is called a “case-cohort” study. In this circumstance, a cohort subject can be a control and then become a case and vice versa.

209
Q

What measures are used for nested case control studies?

A

It can be mathematically shown that the odds (of exposure) ratio from a nested (or a population based) case-control study can provide an unbiased estimate of the risk ratio and rate ratio, and the odds (of diasese) ratio depending on how controls were selected.
Controls sampled from those who are disease free at the end of follow up = estimate of odds of disease ratio
Controls selected from those at risk at the time each case develops = estimate of rate ratio (time-matched analysis required).
Controls selected from all those initially at risk = estimate risk ratio

210
Q
A
211
Q

selection bias definition

A

how subjects get into the study or errors in the process of identifying the study population

212
Q

ascertainment dias is

A

differences in surveillance (detection), hospitalisation (referral, admission or diagnostic).

If exposed cases have a difference chance of admission to controls this can lead to an overestimate in size of effect.

213
Q

sampling bias is…

A

inappropriate comparison group. How representative are the controls of the population giving rise to the cases. This can lead to over/under estimation.

214
Q

participation bias is…

A

self-selection or volunteerism, non-response or refusal.

It is how the responders differ from non-responders which lead to an over/under estimation of effect.

215
Q

how might you limit selection bias?

A

you need to improve your choice of study population.

this can be done via clear definition of study population; using a consistent case definition; trying to enrol all cases; ensuring cases/controls are from same population

216
Q

what is information bias?

A

biases in how data is collected

if study subjects are wrongly classified this becomes misclassification bias

217
Q

misclassification bias is…

A

when study subjects are wrongly categories due to inaccurate diagnosis or admin error etc.

218
Q

reporting bias is…

A

recall bias, where cases may more clearly recall exposures than controls

219
Q

observer bias is…

A

interviewer bias - interviewers case/control status and probes cases differently as a result

data collection instruments - calibration errors or measurement in interviews

220
Q

what is misclassification? what the types and how are they different (4 things)

A

measurement error leads to assigning wrong exposure or outcome category

this can be non-differential i.e: random error; unrelated to exposure or outcome status; not a bias but weakens measure of association.

differential is a systematic error, related to exposure or outcome status; it introduces bias and can distort measure of association in any direction

221
Q

type I error and type ii error defined in terms of rejecting null or not.

how do we reduce type I and II error

A

I - reject null when it is actually true
II - accept null when it is incorrect

increase power of study and select appropriate sample size

222
Q

what is a systematic review?

A

A review of published and unpublished studies focussed on a single question
Aims to identify, appraise, select and synthesise all evidence relevant to the question
Take into account quality of the evidence

223
Q

steps in a systematic review

A
  1. Ask a focussed question (PICO)
  2. Define inclusion and exclusion criteria
  3. Locate studies – Database search, grey literature
  4. Select studies
  5. Data extraction – assess study quality – NOS: observational studies/ Jadad – RCTS.
  6. Analyse results:
    Individual study results
    Meta-analysis if appropriate/ forest plot
    Heterogeneity – fixed and random effects model
    Sensitivity analysis
  7. Interpret results
224
Q

Tell me about heterogeneity, the tests, the models and how to interpret them

A

Chi2 test: null hyp = no heterogeneity. Therefore a small p-value = heterogeneity. If there are only a few studies the test is under powered so a cut of 0.10 is used to assess heterogeneity.

I2 test: Depends on magnitude and direction of effects. Should be used in combo with Chi2 statistic i.e. evidence for hetergeneity. Rough guide – 0-40% litte, 40 – 75% Moderate, 75%+ Considerable.

Fixed effects model: Use if heterogeneity is not deemed to be a problem. Underlying assumption that there is a ‘fixed’ or common feature that underlies all of the studies in the analysis. It treats the studies as if there was no heterogeneity.

Random effects model: Use if there is considerable heterogeneity between studies. Underlying assumption is that the studies are estimating different treatment effects. Looks at the distribution of effects across different studies.

If you try both models and they give similar results it is unlikely that heterogeneity is NOT a problem and you can use either.