Regression Flashcards
What is a correlation?
An association or dependency between two independently observed variables
What graph/plot can we use to visualise a correlation?
A Bar chart
B Bar Graph
C Q-Q Plot
D Scatterplot
D Scatterplot
Pearson correlation co-efficient scores give an r value ranging between -1 and 1,
A score of 0 indicates what?
What does a score of 1 indicate?
What does a score of -1 indicate?
What do positive and negative scores indicate?
0 = no relationship between the variables
1 = Variables are identical
Positive - variables positively correlated
-1 = variables are exactly inverse
negative means variables are negatively correlated
If variables are both interval/ratio, we use a ___________ coefficient test, giving us an ___ value
Pearson’s coefficient
r value
If both variables are ordinal. we use either a Spearman’s rank, giving us a ____ value, or a Kendall’s rank, giving us a ____ value
Spearman’s = Rho
Kendall’s = tau
If both variables are dichotomous (binary), we use a ____coefficient.
phi coefficient
If we have one dichotomous variable, and one interval/ratio variable, we use a _____-______ coefficient, giving us an ___ value.
point-biserial coefficient
rpb value
A partial correlation is used when we have more than __ variables, and we want to test the _________ of a pair, whilst __________ for another variable.
partial correlation = when more than 2 variables
want to test correlation/association of one pair whilst accounting for a third
Multiple linear regressions describe/examine what?
The relationship between one or more predictor variables and a criterion variable
True or false, virtually all statistical models we use (ANOVA, t test, correlations) are special cases of the regression model
True
The regression line has the equation:
y= ax + b
Where y is the ______, x is the _______, ax is the ____/______, and b is the _-_______
y is height, x is length across
ax = slope, b = y-intercept
A residual error is how far a ____ _____ is from the _____ __ ___.
how far data point lies from line of fit
SST = ___ + ____
SST = SSR+SSM
prediction error is the difference between the ______ value and the _____ value
Pe = difference between prediction value and actual value
The best fit of the ____ occurs by minimizing _____ ______
best fit of model occurs by minimising prediction error