Associations Between Two Continuous Variables Flashcards

Question 1

Q

Sometimes we are interested in testing if two continuous variables are associated with one another. What is the most common form of association studied?

Answer

A

Linear association

Question 2

Q

Positive association

Answer

A

When people score high (or low) on the first variable also score high (or low) on the second variable

Question 3

Q

Negative association

Answer

A

When people score high on the first variable and low on the second (or vice versa)

Question 4

Q

What is the most common index of a linear association?

Answer

A

Pearson correlation coefficient

Question 5

Q

Sum of the products of deviation (SP)

Answer

A

Reflects the co-variability (shared variation) of X and Y

Question 6

Q

What produces big positive SP values?

Answer

A

Lots of above/above pairs (two numbers above mean)

AND

Lots of below/below pairs (two numbers below mean)

Question 7

Q

What produces big negative SP values?

Answer

A

Lots of above/below pairs and lots of below/above pairs

Question 8

Q

What produces near 0 SP values?

Answer

A

Equal mix of above/above, below/below, above/below, and below/above pairs

Question 9

Q

r squared is referred to as the…

Answer

A

coefficient of determination

Question 10

Q

r squared reflects…

Answer

A

the proportion of variance that our predictor variable accounts for in our outcome variable (variability explained by linear regression)

Question 11

Q

3 factors influencing the size of r:

Answer

A

Distribution of variables
- Perfect correlations only exist if shape of distributions are exactly the same (positive) or exactly opposite (negative)

Reliability of measures
- Perfect correlations only exist with perfect reliability in both measures

Restriction of range
- Restricting the range of scores on one or both variables can weaken correlations

Question 12

Q

Regression analysis using a single predictor variable is referred to as…

Answer

A

“simple regression”

Question 13

Q

Regression analysis involving two or more predictors is referred to as…

Answer

A

“multiple regression”

Question 14

Q

When two variables are linearly associated, this association can be described using a simple equation:

Answer

A

Y = bX + a

Question 15

Q

What do each of the variables in the regression equation (Y = bX + a) represent?

Answer

A

Y - represents scores on the outcome variable
b - represents slope of best fitting line
X - represents scores on the predictor variable
a - fixed constant representing the Y intercept

Question 16

Q

Standard error of estimate

Answer

A

A measure of the standard distance between a regression line and the actual data points

Basically how much error variance is in our model

Question 17

Q

How is SS error related to r?

Answer

A

As r approaches 1, SS error will become smaller

As r approaches 0, SS error will become larger

Question 18

Q

What is the null and alternative hypotheses of the b value for a simple regression?

Answer

A

H0: B = 0 (there is no linear association between X and Y -> slope is not significantly different from 0)

H1: B (does NOT equal) 0

Question 19

Q

To test the null of a simple regression we partition the variance in Y (DV) into two components:

Answer

A

Variability in Y predicted from linear association

2. Variability in Y predicted by error variability

Question 20

Q

What are the 4 assumptions that simple regression (and its NHST) is based on?

Answer

A

Independence of observations
Linear relationship between X and Y
Residuals are normally distributed with a mean of 0
Homoscedasticity of residuals

Question 21

Q

What makes regression more so like t-tests and less so like ANOVA?

Answer

A

No real follow-up tests are relevant because there is nothing to interpret, we are just looking for raw data.

Regression is not an omnibus test.

Question 22

Q

Total squared error is also known as…

Answer

A

sum of squared error (SS error)

Question 23

Q

Sum of squares (SS)

Answer

A

Sum of the squared deviations
A higher SS value indicates a large degree of variability
A lower SS value indicates data does not vary considerably from mean value

Question 24

Q

Regression degrees of freedom equals…

Question 25

Q

Error degrees of freedom equals…

Question 26

Q

Anytime an SS value is divided by its df value it is…

Answer

A

an index of variance

Question 27

Q

The Pearson correlation coefficient (r) is…

Answer

A

an index of association that assesses the magnitude and direction of linear relation between two variables

AND

an index of co-variability of X and Y relative to the variability of X and Y separately

Question 28

Q

z-score represents…

Answer

A

an individual score’s standing within the distribution for that score

Basically, a score of 1 is 1 standard deviation about the mean

Question 29

Q

3 special cases of Pearson correlation

Answer

A

Point biserial correlation
- Correlation between dichotomous variable and continuous variable

Phi coefficient
- Correlation between two dichotomous variables

Spearman rank-order correlation
- Correlation between ordinal variables

Question 30

Q

Why put r into z scores? (think… r formula using z scores)

Answer

A

Because it allows us to standardize and compare r across different studies. By dividing by sample size we are also standardizing because sample size differs across different studies

Question 31

Q

When we standardize both X and Y (in z form) they equal zero. Thus, the variability (SS) in each of them…

Answer

A

have to be equivalent (SSY = SSX)

Question 32

Q

What is the difference between the homogeneity of variance assumption and the homoscedasicity of residuals assumption?

Answer

A

“Homogeneity of variance” is used in the ANOVA context

“Homoscedasicity” is used in the regression context

Both assume that the variance in residuals is the same everywhere