Regression Flashcards
covariance
how 2 variables covary with each other. uses raw score units. unable to tell strength this way. s
correlation
scale-free degree of linear relationship between 2 varibles. r
scatterplot
strength of association, direction of it, shape of it
pearson correlation
magnittude and direction of linear relationship
change in magnitude
outliers, extreme #’s inflating the mean, curvlinear, errori in x or y
correlation does not equal
causality
correlation coefficient
strength of relationship between 2 variables. r
Y=mx + b in regression form
Y= a +bx, Y=bo + b1x
bo
regression constant, intercept
b1
regression coefficient, slope
when does regression line pass through y axis?
when x=0
regression line always passes through
x bar, y bar
means of predicted y
= means of observed y
Error
actual - predicted Y
least squares criterion
slope and intercept to minimal distance between actual and predicted y. decreases sum of square residuals
improves ability to predict Y from using predictors (x)
regression
R
corrleation betwen observed and predicted = absolute value of correlation
Rsquared
proportion of variance in Y that is accounted for in linear relationship in x. biased, overestimates population
adjusted R squared
unbiased estimate of population
stand error orf estimate
error present in predicting y from x. decrased SEE is more accurate
constant unstandardized B
Regression constant, bo, intercept
underneath the constant in unstandardized B
slope, regression coefficient
unstandardized coeff
for every unit increase in x, there is a [ ] increase in y
R2 = .43
approxing 43% of the variance in y is accounted for by its linear relationship with x
for every 1 unit increase in x, y increases by [what factor]
slope
multiple regression
looking at multiple predictors
1st order model
x has linear relationship with y and does not interact but can correlate
holding, controlling, partialling out
studing x1 on y, but x2 can affect that relationship (can be corr with x1, y or both, so we remove the effect of x2
effect size R^2
proportion of y that is accounted for in model
F test tells what is significantly different than zero
R, Rsquared
T test tells what is significantly different than zero
regression coeffecition (slope) when controlling for effects of other variables
APprox [ ] percent of the variance in y is accounted for in its linear relationship with x
adjusted r^2
a 1 unit increase in x1 is associated with a [-] in loans
decrease [slope]
unique contributors
looking at variance over one predictor over and above another