Regression Flashcards
What does a correlation allow us to do?
allows us to quantify / measure the strength and direction of a relationship.
What does a Pearson’s correlation assume, and what are it’s outputs?
A Pearson’s correlation assumes that a relationship is linear (straight line). It’s reports a t-value, r(cor), and p-value. Only r(cor) and p needed to report
How does a Spearman’s test work? What does it assume?
A Spearman’s test assumes monotonicity. In ascending order it ranks on x- and y-axis in and plots them. Prints s-value, s(rho), and pvalue
What are the main characteristics of a regression line?
outcome v (Y) = slope X + intercept.
What is the slope and what is the intercept.
The slope is what value the regression line begins on the y-axis. The intercept is how much (+ or -) y is predicted to change for every unit increase along the x-axis of the line is affected for every 1 unit along the x-axis. IF x = zero, y = the intercept.
Why do we always square^2 our test equations?
So our postivites and neatives don’t cancel out.
What other name is the slope know as?
Diversity. An lm function will print the intercept and diversity scores
What is a regression line called when it has one outcome variable and two predictor variables?
Regression coefficient
Why does variation allocation change when you add new predictors?
R now has the data to express more accurately which variables account for the variation. Additional varaibles claim variation.
This is the same for ANOVA
What does an interaction variable indicate?
That the affect of variable 1 is going to depend on variable 2
What does a postive and negative interaction do to a reression sheet (3D), and what does it imply for the relationship?
Pos’ interction: bends the sheet up. This means that as V1 increases it is likely to have more of an effect on V2.
Neg’ interaction: bends the sheet down. this means as V1 increases it is likely to have less affect on V2.
What does the printed f-statistic tell us in regression?
This represents how well our whole model fits the varation.
What does a t-statistic tell us in regression?
How likely it is to get a regression line for that variable if the null was true. Null = no difference between regression line and Y- mean or ‘flat line’
What are the two sources of variance used to calculate if a regression model is significant?
Model Sum of Sqaures (SSmod)
Residual Sum of Squares (SSres)
What is the (SSmod) calculating?
The difference between regression line predictions and mean Y. Null: predicts no difference between regression line and mean.
df = k