Correlation and Regression Flashcards
The use of most correlation coefficients is based on three assumptions. They are…?
- The relationship between variables is linear. (Nonlinear may underestimate relationship)
- An unrestricted range of scores. (Restrictions underestimate the relationship)
- Homoscedasticity
This type of bivariate correlations coefficient is used when data on both variables is reported as ranks (ordinal).
Spearman rho (also known as Spearman rank coefficient)
This type of bivariate correlations coefficient is used when both variables are measured on a continuous (interval or ratio scale) and the relationship is linear.
Pearson r (also known as Pearson product moment correlation)
eta may be used as an alternate if the relationship isn’t linear.
This type of bivariate correlations coefficient is used when one variable is continuous interval or ratio scale) and the other is a true dichotomy (e.g., pregnant or not).
POINT biserial correlation coefficient
This type of bivariate correlations coefficient is used when one variable is continuous and the other is an artificial dichotomy. An artificial dichotomy occurs when a continuous variable is dichotomized. (e.g., a range of test scores classified as pass/fail)
biserial correlation coefficient
This type of bivariate correlations coefficient is used when both variables are nominal.
Contingency correlation coefficient
The bivariate correlation coefficient can be interpreted directly as the degree of association between the predictor and criterion. Alternatively, it can be squared to derive a measure of shared variability and indicates the amount of variability in one variable that’s explained (accounted for) by variability in the other variable.
This is called _____?
Coefficient of determination
Correlation is often of interest because the goal is to use obtained predictor scores to estimate criterion scores. Prediction is made possible through _____?
Regression analysis.
The accuracy of the prediction increases as the correlation between predictor and criterion increases.
When two or more predictors will be used to estimate status on a single criterion that’s measured on a continuous scale, this is the appropriate technique.
Multiple regression
What are the two forms of multiple regression?
Standard (simultaneous) multiple regression: entering all predictor data at once.
Stepwise multiple regression: adding or subtracting one predictor at a time to identify the fewest number or predictor needed to make accurate predictions.
When two or more continuous predictors will be used to estimate status on two or more continuous criteria, this is the appropriate technique.
Canonical correlations
When two or more predictors will be used to estimate status on a single criterion that’s measured on a nominal scale, this is the appropriate technique.
Discriminant function analysis
When the assumptions for discriminant function analysis are not met (e.g., when scores on the predictors are not normally distributed), this is the alternative technique.
Logistic regression