Correlation and Multiple Regression Flashcards

1
Q

Partial correlation allows you to examine the relationship between ____ _______ while …

A

two variables while statistically controlling for (getting rid of) the effect of another variable that you think might be contaminating or influencing the relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In a partial correlation output, the word ‘none’ in the left-hand column indicates that…

A

no control variable is in operation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Multiple regression is a family of techniques that can be used to explore the relationship between ___ _______, dependent _____ and a set of _____ _______. The dependent variable (the thing you are trying to explain or predict) needs to be a ______ variable with reasonably normally distributed scores. The independent (or predictor) variables can be ________ or _______ (___ _______).

A

one continuous
variable
independent variables
continuous
continuous or dichotomous
two categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which is better for real world studies (compared to lab ones?) Correlation, Partial Correlation, or Multiple Regression? Why?

A

Multiple Regression

It allows a more sophisticated exploration of the interrelationship within a set of variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

3 of the main type of research question that multiple regression can help answer:

A

➢ how well a set of variables is able to predict a particular outcome
➢ which variable in a set of variables is the best predictor of an outcome
➢whether a predictor variable is still able to predict an outcome when the effects of another variable are controlled for (e.g. socially desirable responding).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Three types of multiple regression

A

➢ standard or simultaneous
➢ hierarchical or sequential
➢ stepwise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In hierarchical multiple regression (also called sequential regression), the (1)_______ variables are entered into the model in the order specified by the researcher, based on (2)______ ______. Variables, or sets of variables, are entered in steps (or blocks), with each (3)______ variable being assessed in terms of what it adds to the prediction of the (4)_______variable after the previous variables have been controlled for. For example, if you wanted to know how well optimism predicts life satisfaction, after the effect of age is controlled for, you would enter age in Block ___ and total optimism in Block ___.

A

(1) independent
(2) theoretical grounds
(3) independent
(4) dependent

1
2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Multiple regression is not the technique to use on ____ samples, where the distribution of scores is very ______

A

small
skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Formula used to find required N for the overall multiple regression

A

N = 50 + 8*m

m = no. of IVs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Multicollinearity refers to… which you can check by…

Multicollinearity exists when the independent variables are _____ correlated ( r = __ and __).

A

refers to the relationship among the independent (predictor) variables, which you can check by generating a correlation matrix (chap 7).

highly correlated (r= .7 and above)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Singularity occurs when ____ _______variable is a combination of other _______ variables (e.g. when both subscale scores and the total score of a scale are included).

If you include highly intercorrelated variables in the model, multiple regression has difficulty ______ the ____ contribution of each predictor and may report them as not ______ _______.

A

one independent
independent

separating
unique

statistically significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

There are two types of outliers that might affect the results of a multiple regression analysis: _____ _____ and ______ ______

A

univariate outliers and multivariate outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Univariate outliers can occur when …

A

a person or case has an unusual score on one of the predictor variables in the model (e.g. a very high income)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When do you check for univariate outliers? Which variables do you check?
How do you deal with them when found?

A

part of the initial data screening process.

You should do this for all the variables, both dependent and independent, that you will be using in your regression analysis.

Outlying scores or cases can be deleted or, alternatively, assigned a score that is high but not too different from the remaining cluster of scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Multivariate outliers occur when the person or case has an unusual _____ of scores on the _____ and _____ variables.

E.g.: with age and income

A

combination
dependent and independent

e.g. a young person who reports a high income.

They may not necessarily be an outlier on the individual age or income variables, but the combination is unexpected and likely to deviate from the model generated by multiple regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Outliers are also called

A

‘influential cases’.

17
Q

homoscedasticity df

A

The variance of the residuals around predicted dependent variable scores should be the same for all predicted scores.

18
Q

Residuals are…

A

the differences between the obtained and the predicted dependent variable scores

19
Q

The residuals scatterplots allow you to check:

A

Normality,
Linearity,
Homoscedasticity

20
Q

Tolerance is an indicator of….

A

how much of the variability of the specified independent is not explained by the other independent variables in the model

21
Q

VIF(____ _____ _____), which is just the inverse of the Tolerance value (___ divided by ____). VIF values above __ would be a concern here, indicating _______.

A

variance inflation factor

1 divided by Tolerance

10

multicollinearity

22
Q

Tolerance is calculated using the formula ______ for each variable. If this value is very _____ (__ than .__) it indicates that the multiple correlation with other variables is high, suggesting the possibility of ______.

A

1 – R squared

Very small (less than .10)

multicollinearity