8. Medical Statistics 2 Flashcards
what is BIVARIATE ANALYSIS
ANALYSIS of the RELATIONSHIP between 2 Variables:
RESPONSE variable and EXPLANATORY variable
statistical methods analyse how the outcome on the RESPONSE VARIABLE DEPENDS ON or is EXPLAINED BY the value of the EXPLANATORY VARIABLE
- needed to formally investigate whether there are meaningful differences and not a result of random chance
what is the RESPONSE VARIABLE
OUTCOME or DEPENDENT
the one on which COMPARISONS ARE MADE
what is the EXPLANATORY VARIABLE
INDEPENDENT or EXPOSURE
usually DEFINES the 2 GROUPS being COMPARED
STEPS in HYPOTHESIS TESTING
- STATE the NULL and ALTERNATIVE HYPOTHESES
- DECIDE what STATISTICAL TEST is appropriate
- Use the test to CALCULATE the P-VALUE
- WEIGH the EVIDENCE AGAINST the NULL
what type of TEST is used for 2 NUMERICAL VARIABLES
CORRELATION / REGRESSION
- Correlation (2 sided association)
- Simple Linear Regression (one-sided association)
what type of TEST is used for 2 CATEGORICAL VARIABLES
CHI-SQUARED TESTS
- chi-squares test (unpaired)
- McNemar test (paired)
what type of TEST is used for 1 CATEGORICAL and 1 NUMERICAL VARIABLE
if 2 GROUPS: T-TEST (paired and unpaired)
if >2 GROUPS: ANOVA (unpaired) & ANOVA for repeated measures (paired)
what are DEPENDENT SAMPLES
when the research hypothesis involves comparing the SAME PEOPLE who were measured twice or more often
(PAIRED DATA)
eg. a diet study in which subjects’ weights are measured before and after the diet.
the observation in the first (before diet) and second (after diet) samples are related because they refer to the same person
Analysis of DEPENDENT SAMPLES requires use of the paired or unpaired version of the statistical tests
PAIRED VERSION
what are INDEPENDENT SAMPLES
and is the paired or unpaired version of the respective statistical test used
involve comparison of 2 or more groups who are INDEPENDENT from each other
- DIFFERENT INDIVIDUALS
eg. randomised trial that randomly allocates subjects to 2 treatments
eg. observational study that separates subjects into groups according to their value for an explanatory variable (ie smoking status)
the UNPAIRED VERSION of the statistical test is used
CORRELATION / REGRESSION
in SCATTERPLOT what is on the HORIZONTAL and VERTICAL AXIS
Horizontal, x : PREDICTOR VARIABLE
Vertical, y : RESPONSE VARIABLE
CORRELATION / REGRESSION
what to look for in SCATTERPLOTS?
DIRECTION of the relationship
- NEGATIVE: as one goes up other goes down
- POSITIVE: as one goes up other also goes up
FORM of the relationship
- LINEAR? or not
STRENGTH of the relationship
- points appear tightly clustered in a single stream or form a vague cloud?
OUTLIERS
CORRELATION / REGRESSION
what measures the STRENGTH of the LINEAR ASSOCIATION between 2 NUMERICAL VARIABLES
CORRELATION COEFFICIENT (r)
r is always between -1 and +1
r>0 POSITIVE correlation
r<0 NEGATIVE correlation
r=0 NO correlation
CORRELATION / REGRESSION
do OUTLIERS AFFECT CORRELATION
Correlation is VERY SENSITIVE to OUTLIERS
an extreme outlier can cause a dramatic change in r
CORRELATION / REGRESSION
what is COEFFICIENT of DETERMINATION (r^2)
expresses the PROPORTION of the VARIANCE in one variable that is ACCOUNTED FOR or ‘EXPLAINED’ by the variance in the other variable
- square of r
eg. a study finds an r=0.40 between salt intake and blood pressure.
It can be concluded that 0.40^2 = 0.16
or 16% of the variance in blood pressure in this study is accounted for by salt intake