Basic Epi Flashcards
Descriptive study definition including study types
Measure the occurrence of outcomes.
Can be split into either populations or individuals.
Individuals - case reports, case studies, case series, surveillance and prevalence cross-sectional studies
Populations - Ecological studies
Analytical study definition
Test the association between exposure and outcome
How can you measure the distribution of disease?
Time - year, season, day, hour
Place - country, region, district
Person - age, sex, social class, lifestyle
Four commonly used sources of data
Routine statistics
Population censuses
Surveys
Special studies
5 routine statistic sources
Death certificates
Birth records
Special disease registers - cancer registries
Communicable disease reports
GP records
What is an ecological study? (at least 4)
An observational design study (no treatment)
Use routinely collected data
Based on groups - not individuals (group is unit of observation), not possible to link exposure to his/her outcome
Uses correlation coefficient (r)
Useful for generating hypotheses, not useful for true exposure risk at individual level.
Can be useful at looking at group level disease e.g. schools
An example could be air pollution and mean bmi of an area
Why use an ecological study?
to investigate aetiology and risk factors for disease or evaluate changes in health care policy
generate hypotheses
estimate prevalence
What is the definition of ecological fallacy?
Associations at population level do not imply association at an individual level
What are the strengths of ecological studies? 8 marks
Quick and cheap
Use available data e.g. routine stats
Some factors operate at population level e.g. air pollution
Some exposure data only available at population level
Differences in exposure between areas may be larger than those between individuals in one area
Ability to map ecological data
Can generate hypotheses
Random errors may be smaller for populations than individual exposures
What are the weaknesses of ecological studies?
Data may be collected or recorded differently in different places
Surrogate measures based on the average of the population
Spatial boundaries are artificial
Confounding (lack of data)
Could use proxy measures
Classification challenges
Ecological fallacy
Uncertainty in temporal relationship
Collinearity in variables (i.e. your variables are too similar)
What is a cross-sectional study? (8)
An observational study
Carried out at a single point in time
Snapshot of population health
Collects individual data
Can measure prevalence, not incidence
Cannot prove cause and effect
Good at generating hypotheses
Can be descriptive or analytical (descriptive will describe the data, analytical will investigate risks factors and outcomes, collecting data on outcomes and exposures at the same time)
Typically use surveys to gain data
Examples of national CS studies
Surveys - census, national survey for Wales, national attitudes and lifestyle survey
CS design
Population has a representative sample (if not using whole pop).
Descriptive:
Then you need a number with disease or exposure and then a number without disease or exposure.
This will allow calculation of prevalence of disease
Analytical:
Will need number with/without disease and then with/without outcome, so 4 sample groups vs 2 groups.
CS Study and temporality
CS study - difficult to measure temporality, chicken and egg scenario. This temporality issue depends on the exposure e.g. a genetic factor does not vary over time whereas exercise levels will change
Strengths of CS studies
Quick
Cheap and simple
Good for chronic diseases
Data on individuals e.g. questionnaires
Can estimate prevalence
Can assess many outcomes and risk factors
Can generate hypotheses
Weaknesses of CS studies
No good for acute diseases
No good for rare diseases
Prone to bias
Need high participation rates to be valid
Only a snapshot
Cannot infer temporal or causal relationships
Bias definition
Bias is a consequence of defects in the study design or execution of a study and cannot be controlled for by statistical measures and often cannot be mitigated by increasing sample size.
It is any systematic error in an epidemiological study that results in an incorrect estimate of the association between exposure and outcome
Name two types of bias
Selection bias (differences between groups) in how study subjects are chosen or respond
Information bias (difference between groups) in the accuracy of data on exposure/outcome
CS studies key metholodigical concerns
Validity and repeatability
Response rates (non-response)
Sampling - how representative of the true population is the sample? (sample size calls done?)
Association is NOT the same as causation
Definition of confounding
A spurious relationship between exposure and outcome due to the presence of another variable which is associated with both the exposure and the outcome
How to control confounding?
By study design methods:
- randomisation (only in intervention designs)
- restriction (limit entrance to those within specific categories (e.g. age group)
- matching e.g. age or sex
By analysis methods
- stratified analysis
- multivariate analysis (can control simultaneously for several factors)
The goal of statistical analysis in the context of sampling is…
If we wish to say something about an attribute of the population, we take a sample so we can make inferences back to the population
Sampling in observational vs intervention
In observational you sample participants to observe
In intervention you allocate participants to observe
Why do we sample?
Not possible to collect info from all subjects
Sample can provide reliable info on the population by:
- estimating important population parameters (means, proportions etc.)
- to infer toward the populations (using valid distributional assumptions)
- present inferences using estimates, confidence intervals, hypothesis tests (p values)
We do this to potentially minimise bias