Statistics V - Correlation, Causation, Regression Flashcards
Regression is a model of what?
A model of the influence of x on y.
What’s a multivariate regression?
Regression with
1 independent and
several dependent variables
(-> several univeriate regressions)
What’s a multiple regression?
Regression with
several independent and
1 dependent variable.
What’s a multivariate multiple regression?
Regression with
several independent and
several dependent variables
What’s a regression with
1 independent and
several dependent variables
(-> several univeriate regressions)?
A multivariate regression.
What’s a regression with
several independent and
1 dependent variable?
A multiple regression.
What’s a regression with
several independent and
several dependent variables?
A multivariate multiple regression.
What is the multivariate regression used for?
It is used for investigating the effects of one independent variable on several dependent variables.
In a multiple regression: What does ßi stand for?
the partial coefficient of regression
Is the partial coefficient of regression equal to the bivariate coefficient of regression?
No!
What can we learn from the following example?
Epidemiological studies showed that women who were taking combined hormone replacement therapy (HRT) also had a lower- than-average incidence of coronary heart disease (CHD), leading doctors to propose that HRT was protective against CHD. But controlled trials showed that HRT caused a small and significant increase in risk of CHD. Re-analysis of the data showed that women undertaking HRT were more likely to be from higher socio-economic groups (ABC1), with better than average diet and exercise regimes.
Correlation DOES NOT imply causation!
What is a confounding variable?
A confounding variable is an extraneous variable in a statistical model that correlates (positively of negatively) with both the dependent and the independent variable.
Other terms for confounding variable include…
… confounding factor, hidden variable, lurking variable, confound or confounder.
What is the Simpson paradox / Simpson reversal (aka. Yule-Simpson effect)?
In probability and statistics, Simpson’s paradox, or the Yule–Simpson effect, is a paradox in which a trend that appears in different groups of data disappears when these groups are combined, and the reverse trend appears for the aggregate data.
Two examples of Simpson’s paradox?
Kidney Stone Treatment
Berkeley gender bias case