Midterm 1 - Correlation & Regression (Ch. 3) Flashcards
Correlations serve to determine the extent to which ______
changes in scores for 2 variables are related
Pearson’s correlation coefficient provides a quantitative description of the ___ and ___ of a straight-line relationship between two variables
direction and strength
Pearson’s product-moment correlation coefficient describes relationships between scores from ____
interval and ratio scales
what is the range for correlation coefficients?
-1 to 1
Pearson correlation (r) can be calculated by dividing _________ by _______
degree to which X and Y vary together (covariability)
degree to which X and Y vary separately
(T/F) Pearson correlations can sometimes be used to describe non-linear relationships
FALSE
Pearson correlations are (slightly/strongly) affected by outliers
strongly!
A regression line is a straight line that describes how _____ changes as ____ changes. Essentially, it provides an average statement about the change in __ that is ass w change in __
a dependent (outcome) variable changes as an independent (predictor) variable changes
change in Y ass w change in X (often used to predict value of y for a given value of x)
What is the criterion to satisfy when placing a regression line?
sum of all vertical distances from each point to line should be as small as possible
What is the least-squares regression line?
line of Y on X that makes the sum of the squares of the vertical distances of points from line as small as possible
in a regression line, vertical distance to the line is calculated by subtracting ___ from ___
subtract predicted y(w hat; on line) from observed y (point)
In a regression line, Vertical distance to the line is also called the ____ or ____
error of prediction or residual
What is the regression equation?
ŷ = bx + a
a = intercept (y when x is 0)
b = slope (amount y changes when x increases by 1)
What does the regression line look like when the correlation is 0?
horizontal to x axis
to determine how useful/accurate a regression line is for prediction, we compute the ___ which measures the ______ in one variable that can be determined from the relationship with the other variable
r(squared) - aka coefficient of determination
measures the proportion of variability….. (basically shared variability)