10. Correlation and Regression Flashcards
what is the bivariate case?
the case where there is one predictor (IV) and one criterion variable (DV)
what is the abbreviation for pearson’s correlation coefficient?
r
what is the range of pearson’s correlation coefficient?
-1 and +1
what do negative and positive values of pearson’s correlation coefficient indicate??
the direction and strength of a linear relationship between an X and Y variable
what does a negative pearson’s correlation indicate?
indicates a relationship where increases in one variable are associated with decreases in the other and vice versa
what does a positive pearson’s correlation indicate?
indicates a relationship where increases in one variable are associated with increases in another and decreases with decreases
what is a pearson’s correlation coefficient designed to test?
for a linear relationship
what is pearson’s correlation coefficient based on?
the ability to invisibly draw a straight line through the data point
what must a pearson’s correlation coefficient be?
a linear relationship
what is the formula for pearson’s correlation coefficient (r)?
r =
N
what is the procedure for calculating r?
for each score, calculate its corresponding Z score (Z_x and Z_y) ensuring that you use the correct mean and Sd
Multiply the Z_x by the Z_y to get the cross product ensuring that the sign (+ or -) is correct
add all the crossproducts (ΣZ_XZ_Y)
and divide by the number of pairs of scores
what do Z scores tell us
whether a score (X or Y) is above (+Z) or below (-Z) a mean (M)
what do we get if we multiply Z values on X and Y for each person?
crossproducts
what does it mean when we get mostly positive cross products?
positive correlation
what does it mean when we get mostly negative cross products?
negative correlatin
what does it mean when we get an equal number of + and - cross products?
no correlation
what are the characteristics of a positive correlation?
- mean on X (+ZX), above mean on Y (+ZY)
- Below mean on X (-ZX), below mean on Y (-ZY)
- ZXZY values are mostly positive (multiply two negatives = positive)
what are the characteristics of a negative correlation?
- Above mean on X (+ZX), below mean on Y (-ZY)
- Below mean on X (-ZX), above mean on Y (+ZY)
- ZXZY values are mostly negative
what are the characteristics of no correlation?
- Position on X (+ZX or -ZX) not linked to position on Y (+ZY or -ZY)
- ZXZY values are equally positive and negative
what does it mean if r is closer to +1
there is a strong positive relationship
what does it mean if r is close to -1
strong negative relationship
what does it mean if r is close to 0
weak relationship
what can correlation tell us?
the direction of our relationship
what can regression help us with?
helps us actually plot the line that the correlation metaphorically draws through the data
what can the line that regression assists in drawing help with?
helps us to predict scores on our DV
what is correlation coefficient a measure of
how close to the line the data is
why is it called the line of best fit?
because it is drawn in the position that minimises the distances of alll the data points
what does the line of best fit influenced by??
every data point in the scatterplot
where does the line of best fit carefully place itself?
in a position that overall is closest to all teh data points that it can be
what are we calculating with regression
the ordinary leased regression (OLR)
what are the two pieces of information that we need to calculate the line of best fit based on our x and Y scores?
the slope and the Y-axis intercept
what is the slope?
how steep it is