Health Data Science Flashcards
Give some categories of health data
Patient data Specific instruments (Questionnaires, rating scales) Data from blood and tissue samples Data from images Health and fitness devices
What is measured in a cross- sectional study?
Measures variables of interest at the same time
eg classically exposures (Risk factors) and outcomes (disease)
Give some examples of cross-sectional study?
- Prevalence studies
- Aetiological studies
List some strengths of cross-sectional studies?
- Relatively easy/cheap to conduct
- Provide distribution/burden of exposure/outcome information
List some weaknesses of cross-sectional studies?
- Only measures prevalence, not incidence
- Can be difficult to establish time-sequence
What is a case-control study?
Starts with cases and controls and look to see who had the exposure (Risk factor) in the past
Often used for diseases with a long latent period.
List some strengths of case-control studies?
- Quick and relatively cheap (compared to cohort)
- Good for studying rare diseases
- Good for diseases with long latent periods
List some weaknesses of case-control studies?
- Prone to selection bias (ie unrepresentative controls)
- Prone to information bias
- Cannot establish the sequence of events
What is a cohort study?
- Aetiological research - people without a disease, risk-factors measures and then follow-up for disease
- Prognostic research - People with a disease, characteristics measured, follow-up for outcomes
List some strengths of cohort studies?
- Exposures/Risk factors measured at start of study before outcome occurs - no measurement bias.
- Can provide data on time course
- Multiple outcomes can be measured
List some weaknesses of cohort studies?
- Slow and potentially expensive
- Inefficient for rare diseases
- Exposure status may change during study
- Differential-loss to follow up may introduce bias
What would be the gold-standard interventional study?
Randomised controlled trial
What are the benefits of proper randomisation?
- Comparison groups should be similar with respect to confounders, both measured and unmeasured
- Prevents bias in the allocation of participants to treatment/control
- Only difference between groups should be if they received the intervention - therefore any difference should be attributable to the intervention
What should be considered in RCT risk of bias?
- Was randomisation sequence unbiased?
- Was allocation concealed until enrolment?
- Were participants/outcome assessors aware of treatment group?
- Have participants deviated from intended interventions?
- Are they missing data which could introduce bias?
- Was measurement of outcome unbiased?
- Was the pre-specified primary outcome reported?
What are some opportunities provided by ‘big data’?
- Wide applications - predictive modelling, clinical decision report, safety monitoring, public health
- Potentially more comprehensive data
- More-detailed data (eg wearable devices)
- Costs/efficiency