Chapter 9 simple linear regression/correlation slide 13 onwards Flashcards
3 assumptions when doing pearson correlation analysis
- X and Y are quantitative data
- Variables X and Y are simple random variables
- Pairs of X and Y follow the bivariate normal distribution
- individual variables are normally distributed
- r is sensitive to outliers
Strong positive relationship correlation coefficient
Close to +1
Negative linear relationship correlation coefficient
Close to -1
No linear relationship correlation coefficient
Close to zero
Hypothesis testing for correlation
ρ (represents correlation coefficient of populations)
Null hypothesis: ρ =0
Alternative hypothesis: ρ not equals to zero
Often, coefficient of determination is expressed as
Proportion or percentage
What does it mean when coefficient of determination r^2 is 0.85
85% of the change in y is caused by a change in x
The ______ the correlation coefficient, the ______ the coefficient of determination. This implies that _________ in dependent variable is influenced by independent variable
larger, larger, more changes
interpretation when r^2 is zero
no correlation
interpretation when r^2 is up to 49%
low correlation (need consider e sign)
interpretation when r^2 is 50-95%
high correlation, need to consider the sign
interpretation when r^2 is 96-99%
very high correlation, need to consider the sign
interpretation when r^2 is 100
perfect correlation (need to consider the sign)
2 methods for determining correlation for non-normally distributed data
logarithmic transformation and non-parametric correlation analysis
what is non-parametric correlation analysis
ranking data from smallest to largest using Kendall’s or Spearman’s rank correlation to calculate the correlation coefficient