Epidemiological data collection and analysis Flashcards
How is research needed in public health?
Public health is of high societal value, is a collective responsibility and is at the heart of all human activities and progress
Health research relies on processes such as:
- Systematic and timely assessment of disease
- Developing novel, efficient and targeted strategies in order to protect and improve public health
- Evaluating decisions and priorities in promoting public health - that is to:
- Prevent
- Control
- Eradicate disease
Continuous monitoring of population health allows the key policy makers who are involved in the decision-making processes to consider the health impacts in relationship with other societal factors including economics.
What are the questions addressed in public health?
What is the disease and its frequency in the population?
Why is this happening - what is the source/cause of the disease?
- Emerging due to an infectious agent
- Consequence of long term exposures
Where is it happening – locally or widely spread?
What is the dynamics of the disease?
Which groups in the populations are mostly affected?
What are the modifiable factors which could lead to interventions to prevent disease?
Disease occurrence – an intertwined result of
Inherited predisposition
Environmental exposures
Life-style and socio-economic circumstances
Which data types are essential in epidemiology?
- •Understanding population health is based on information collected on individuals or on groups
-
Informative individual level data
- Demographics (age, gender, ethnicity),
- socio-economic circumstances (measures of poverty etc)
- presence of disease
- access to education and health care
- health facilities records
- •Information on aggregated level data (ecological data) – surveillance processes
- Counts on groups in the population
How is data processed?
Data are processed using
epidemiological concepts which provide measures for disease and its associations with various risk factors
–statistical techniques to generate numbers for these measures and associations as well as the uncertainty around these numbers
–on rigorously collected data and thoroughly thought designs - statistics use samples in the population to provide generalizable results at population level
What is epidemiology?
•– the study of the distribution and determinants of health related events in specified populations which serves as to control, prevent and eradicate disease
- Quantitative - not a clinical science, unified umbrella for measuring disease
- Object of study – population not the individual
- Fundamental concepts: disease and exposure (harmful or protective)
- Absolute measures= the disease in the population
- Relative measures the disease - associations of disease with exposures or with groups in the population (relative measures)
What are the differences between absolute measures of the disease and relative measures of the disease?
Absolute measures the disease in the population
- Prevalence, risk , odds – static measures of the disease
- Incidence, rates – dynamic measures of the disease
- Transmission – infectious diseases specific
Relative measures the disease - associations of disease with exposures or with groups in the population (relative measures)
- Relative risk/rates/odds of disease
- exposed vs. non-exposed
- one group vs. another defined by demographic factors such as ethnicity
What is an example of an epidemiological study?
the 1854 London Cholera Outbreak – DR John Snow
- The spatial clustering of cholera deaths around the Broad Street well provided strong evidence in support of his theory that cholera was a water-borne disease.
- Echoes modern GIS analysis
Intersection of Broadwick and
Lexington Streets – W1
What are the essential concepts of epidemiological studies
- Individual = unit of interest in clinical medicine
- Population = concept used in epidemiology in reference to large groups which share similarities –people, animals or even plants
E.g.:
- Children, infants, neonatal population
- Population of a county/country/continent/world
- HIV population, TB population, cancer population
- Tree population, fish population
Exposure = concept used in reference to a harmful or protective element which can potentially influence health status.
- environmental (air pollution, sun’s rays, clean water/air)
- chemical (food, water or treatment)
- biological (genetic inheritance)
- social (poverty or wealth)
- life style – moderate exercise intake
Scope of epidemiology
•Deriving averages in the population which serve public health policy makers
•
•Questions addressed at population level
–What is the problem and its frequency
–Who is affected – what is the population?
–Which groups in the population are mostly affected
–When/Where does the problem manifest
–Why does it occur?
–
•Health status in the population is subjected to dynamic changes in various circumstances
–Hence the need for constant monitoring
•
•Epidemiology = obtain, interpret and use data information to promote health
–Prevent (e. g. screening/vaccination)
–Reduce (interventions/therapies)
–Control, contain (contingency measures) and eventually eradicate
•Understanding disease-exposure patterns are essential for decision making bodies to draw efficient and targeted guidelines at population level. In simple words, that is allocating money where are mostly needed
How are statistics used in epidemiology?
•Statistics in population health research translate epidemiological measures into numbers and associated uncertainties. They can be results of
–Hypothesis testing – for various type of data
–Groups comparisons relative to health outcomes and exposures
–Advanced statistical modelling tailored to the nature of the outcomes to investigate patterns of disease in the population
–
•From a sample in the population (data at hand) to generalizable results and predictions to the population the sample is selected from
•
•Epidemiology uses data and statistics to provide answers to health related questions
What is statistics?
- part of mathematical sciences and have applications in many fields including health, economics, biological science, business, finances, etc.
- Statistics in population health research translate epidemiological measures into numbers and associated uncertainties. They can be results of
–Hypothesis testing – for various type of data
–Groups comparisons relative to health outcomes and exposures
–Advanced statistical modelling tailored to the nature of the outcomes to investigate patterns of disease in the population
–
What are summary statistica?
•Summary statistics
–A succinct and relevant assessment on variables at hand (the sample we work on)
–Recognising variable types is crucial as they are summarized and processed differently
–Variety of statistical software analyses data - users need to be concerned with data types and layers of dependencies
What are the different data types in statistics?
•Data types
–Variables of interest are often called outcomes or response or dependent variables in statistics (medicine/clinical sciences every measurement is usually called an outcome)
•Quantitative
–Continuous (weight, height, age, blood pressure)
–Discrete (number of accidents/week, monthly number of deaths)
•Qualitative
–Ordinal (severity of a diseases, Likert scale of agreement)
–Nominal (gender, ethnicity, presence/absence of disease, blood group)
What are the differences between quantitative and qualitative variabes?
•Quantitative variables
–Variables’ summary
- Means, Medians (Q2) – measures of central tendency, location
- Standard deviation – measures of spread of the data at hand - purely descriptive
- Q1-first and Q3 (IQR=Q3-Q1) - third interquartile further information on location and spread of the data
- Min, Max, Range(Max-Min)
–Visualization: histogram, box plots
•
•Qualitative variables
–Variables’ summary: proportions
–Visualization: bars of frequencies or proportions, pie plots
What are different concepts associated with statistical inference?
–Hypothesis testing
–P-values
–Parameter Estimates
–Standard Errors
–95% Confidence Intervals
All intrinsically linked
What is the null hypothesis, p value and standard errors and 95% CIs
Null Hypothesis usually a statement declaring no difference
- Between a collected sample and the population (no difference, similarity)
- Between two or more samples (no difference, similarity)
- Between a disease occurrence and a prior exposure (no association)
P-value
- A probability which measures the strength of the evidence against the null
- Or how far is the sample data distribution from what the researcher hypothesises
- <0.05 evidence AGAINST the null (evidence of dissimilarity)
- >=0.05 no evidence against the null (NOT FOR THE NULL)
Standard errors and 95% confidence intervals - inferential concepts
- associated with precision and uncertainty of the estimates (such as means, proportions, risk, odds, risk ratios, odds ratios)
Difference between standard deviation and standard error of the mean
Standard deviation: indicates the spread of the data at hand
Standard error of the mean: indicates the precision of the sample mean
What is the prevalence of a particular disease in a population?
-
Prevalence is a risk – measures an existing part/share/proportion of a condition relative to a referenced population at a time point or within a defined time period.
- For a fraction a/b, a is the numerator, b is the denominator
- Numerator IS always part of the denominator.
- Always refer to it as the “prevalence/risk of a condition in a population”
- Unless the context is crystal clear, using solely “prevalence/risk of a disease” is highly objectionable
Infectious diseases examples:
- Prevalence/risk of HIV in UK population in 2017 (0.16% - gender differences)
- Prevalence/risk of HIV in UK gay population in 2017 (5% - regional differences)