Summa Week 7 Flashcards
regression
What is regression?
a way of predicting the value of one variable from another
Regression is a _____ model of the relationship between ____ variables
hypothetical
two
The regression model is a ____ one.
Linear or curvilinear?
linear
We describe the relationship of a regression using the equation of a ________ _____
straight line
_______ association can be summarized with a line of best fit
bivariate
Bivariate association can be summarized with a ______________________
line of best fit
The _____________ would have the least amount of errors in a regression line
the line of best fit
What do we also call the “line of best fit”?
the regression line
What else do we also call the “line of best fit”?
the prediction line
What is the formula for a best fit line?
Yi = bo + b1X1 + E
or
Yi = B0 +B1X1 + Ei
What is bi in regression?
the regression coefficient for the predictor
what is the predictor?
the horizontal axis of a scatterplot used to find a regression line
what is another name of the gradient of the regression line?
slope
what is another name of the slope of the regression line?
gradient
What is the slope symbolized by?
bi
What does bi suggest regarding the relationship of a regression line?
the direction and/or strength of the relationship
What does b0 mean in a regression line?
the intercept (value of Y when X = 0)
When using b0 in a regression line, the value of Y is determined by X = ?
0
What also is b0?
the point at which the regression line crosses the Y-axis
What is another name of the point at which the regression line crosses the Y-axis?
the ordinate
When the regression line is properly fitted, the error sum of squares is ____ than that which would obtain with any other straight line.
smaller
When the regression line is properly fitted, the error sum of squares is smaller than that which would obtain with any other straight line. What is this describing?
the least squares criterion for determining the line of best fit/regression
What is the least squares approach?
the least squares line has a sum of errors (SE), and sum of squared errors (SSE) which is smallest of all straight line models
What does SE signify?
sum of errors in a least squares line
What does SSE refer to?
the sum of Squared errors in the least squares line approach
How good is the least squares line model?
only as good as the data given
do we need to test how well the least squares model fits the observed data in a regression?
hell yeah
What is another way of understanding regression (and by that token, ANOVA)?
total variation = explained variation + unexplained variation
What is the formula for regression?
Sum(Y-Y_)^2 = Sum(Y’-Y_)^2 + Sum(Y-Y’)^2
What is the sum of squares?
the proportion of variance accounted for by the regression model
the proportion of variance accounted for by the regression model
the sum of squares
What is a symbol for the sum of squares?
r^2
What is r^2?
the Pearson Correlation Coefficient Squared
What is te formula for the Pearson Correlation Coefficient Squared / proportion of variance accounted for by the regression model / r^2?
r^2 =
sum(Y’-Y_)^2/Sum(Y - Y_)^2
= Explained Variation / Total Variation
A regression allows you to predict Y values given a set of X values, however it does not allow you to attribute causality to the relationship. To or F?
True