Causality, Bias & Confounding Flashcards
What questions are asked in description?
- what happened?
- who was affected?
- people with X had Y
What questions are asked in prediction?
- what will happen?
- who will be affected?
- people with X are more likely to have Y?
What questions are asked in causal inference?
- what will happen if…?
- why were they affected?
- if we changed X, how would it change Y?
What questions would be asked if it was qualitative?
- what matters?
- why does it matter?
- how can we effectively change x?
- should we change x?
The headline “organic food lowers blood and breast cancer risk” is an example of what?
causal nonsense
it is implying that if you eat organic food, you will have a lower risk of contracting cancer
What type of approach to causal inference is shown here?
‘causation’ of infectious disease is fairly simple
this is deterministic
What is meant by a deterministic relationship?
a deterministic relationship involves an exact relationship between two variables
the deterministic model gives the same exact results for a particular set of inputs, no matter how many times you re-calculate
What is an example of a deterministic relationship on a molecular level?
molecular and cellular processes (e.g. laboratory studies) show a deterministic relationship
relaxed myometrial cell + prostaglandin E2 = contracted myometrial cell
Why can a deterministic model not be used for the vast majority of health outcomes?
for the vast majority of health outcomes, there are multiple causes
What is the difference between a deterministic model and a probabilistic model?
probabilistic models incorporate random variables and probability distributions into the model of an event
a deterministic model gives a single possible outcome for an event
a probabilistic model gives a probability distributon as a solution
What type of relationship is shown here?
What problem does this raise with causal inference?
this relationship is NOT probabilistic
how do we identify causes and what works “best” when one thing doesn’t necessarily lead to another?
What is the fundamental problem of causal inference?
you can never know what would have happened if you had done things differently
i.e. we cannot observe the counterfactual
What do we need to do in order to study how most things work?
to study how most things work, we have to come up with an “estimate” of the counterfactual
i.e. a control
What is the problem with estimating the counterfactual?
individual people are very different and have lived very different lives
What is meant by exchangability?
Why is it important?
because everyone is different we have to work with groups of people and find ways to ensure our groups are - on average - comparable
What is the best way to achieve exchangability?
the easiest way to do this is through randomisation
this produces both the intervention group and the comparison
Is randomisation always going to produce exchangability?
NO because randomisation is a blunt tool
the sample needs to be large enough to account for differences
What is meant by random sampling error?
the random error in our population estimate (s) that results from chance fluctuations in the profile of our sample
e.g. want 50% blue and 50% green
the sample contains 83% blue and 17% green
Without randomisation, does a bigger sample size help to acheive exchangability?
without randomisation, the exposure is assigned by the underlying bio-psycho-social determinants
a bigger sample won’t help to achieve exchangability