Regression and Correlation Flashcards
What is correlation?
Correlation quantifies the strength of the association between two quantitative variables
What is Pearson’s correlation coefficient a measure of?
The scatter underlying a linear trend between two quantitative variables
What is linear regression?
It studies the linear relationship between two quantitative variables when one (dependent variable) is modelled as depending on the other (independent variable)
A linear regression model allows predictions about the dependent variable to be made among individuals. T/F?
True
Correlation models allows predictions about the dependent variable to be made among individuals. T/F?
False
Correlation is quantified on a scale from -1 to 1. T/F?
True
What are the assumptions in calculating correlation?
Independent observations
Bivariate Normal distribution
Relationship between X and Y is linear
The value and units of measurements of variables is unimportant for measuring correlation. T/F?
True
The value and units of measurements of variables is unimportant in regression models. T/F?
False - this is significant
Which variable is classified as X and which as Y is significant in correlation. T/F?
False
Which variable is classified as X and which as Y is significant in correlation. T/F?
True - Y should be the dependent variable. and X should be the independent variable
Why is the calculation for a CI or hypothesis test for correlation not the same as for tests of associations?
Because the sampling variability for correlation does not follow a Normal distribution
How can a straight line be described mathematically?
Y = alpha + beta X (alpha = y-intercept) (beta = gradient of line)
In a regression model, what is a residual?
The vertical distance of a data point from the line of best fit
How is the best fit line plotted in linear regression models?
The best fit line is taken as the. one which makes the sum of the squares of. the residuals as small as possible. I.e. it minimises the variance of the residuals