Chapter 6- Correlation and Regression Flashcards
What is a univariate distribution?
It is a frequency distribution where you’re only working with one variable
Ex. height, time, test scores, errors
What is a bivariate distribution?
A frequency distribution where the scores on the two variables are paired
Bivariate distributions are required in order to use correlation and regression techniques
They also may show positive/negative/zero correlation
What is considered to be a positive correlation and give an example?
It’s when high measurements on one variable tend to lead to high measurements on the other variable.
And low measurements on one variable also tend to lead to low measurements on the other variable.
Example- all fathers tend to have sons who grow up to have sons who grow up to be tall men, and
short fathers tend to have sons who grow up to be short men
What is considered to be perfect correlation?
When the coefficient of correlation, r, is equal to 1.00
When there is perfect correlation all points fall exactly on the regression line
The only requirement for perfect correlation is that the differences between the pairs all be the same
What is a regression line also referred to as?
The line of best fit
What is a scatterplot?
It is a graph of the scores of a bivariate frequency distribution
What do you see graphically in a scatterplot with negative correlation?
Increases in one variable are accompanied by decreases in the other variable (they have an inverse relationship).
The line of regression goes from the upper left corner of the graph to the lower right corner, it has a negative slope
Is there a such thing as a perfect negative correlation?
Yes, when r = -1.00
What is r and what does it tell us?
r is the correlational coefficient
It gives us the degree of relationship of two variables. One value on a plot isn’t more valuable than the other.
We just want to find how correlated the two factors are to find out how they relate.
And you’d rather have a stronger r value towards 1 or -1, because it helps solidify your data rather than a small
correlation
Examples of negative correlation
Daily rain and daily sunshine, grouchiness and friendships, highway driving speed and gas mileage
What does having zero correlation mean?
That the high and low scores on the two variables are not associated in any predictable manner
There is no linear relationship between the two variables
When r = 0.00 the regression line is literally just a horizontal straight line
What are the two computational formulas used to find r?
The blanched formula and the raw-score formula
We’re going to be using the blanched formula, because it uses means and standard deviations to find it (bro, but
I really like just using the raw-score formula
What is N when concerned with the computational formulas?
It isn’t every single data set or point. It is the number of number PAIRS.
What is r used for sample or population?
r is used for a sample statistic. The population parameter is symbolized by the Greek letter rho (weird p, in physics it’s
for density or pressure or somethin’)
What is considered small, medium, and large for the correlational coefficient?
Small- 0.10
Medium- 0.30
Large- 0.50