3. Bivariate Correlation and Regression Flashcards

1
Q

What variables can Pearson’s correlation coefficient be used with?

A
  • Binary/categorical variables

- Continuous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Strength of correlations (Pearson’s r):

Weak =
Moderate =
Strong =

A

Weak = +/- .1

Moderate = +/- .3

Strong= +/- .5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does r squared tell us?

A

Amount of shared variance

AKA: The coefficient of determination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the F ratio tell you (SPSS output)?

A

The ratio of how much the prediction of DV has improved by fitting the model, compared to how much error still remains

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a correlation?

A

It is a way of measuring the extent to which two variables are linearly related

  • It measures the pattern of responses across variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What assumptions is the validity of Person correlation based on?

A
  • data is at continuous (scale/interval/ratio) level
  • data values are independent of each other; i.e. only one pair of readings per participant is used
  • a linear relationship is assumed when calculating Pearson’s coefficient of correlation
  • observations are random samples from normal or symmetric distributions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Non-parametric correlation

Spearman’s p (rho, rs)

A
  • Variables are not normally distributed and the measures are on ordinal scale ( e.g. grades)
  • Works by first ranking the data n(numbers converted into ranks), and then running Pearson’s r on the ranked data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Kendall’s τ (tau)

A
  • For small datasets, many tied ranks

- Better estimate of correlation in population than Spearman’s ρ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Biserial correlation

A

When one variable is dichotomous, but there is an underlying continuum (e.g. pass/fail on an exam)

  • A point-biserial and biserial correlation is used to correlate a dichotomy with an interval scaled variable. The difference is that the point-biserial correlation is used when the dichotomous variable is a true or discrete dichotomy and the biserial correlation is used with an artificial dichotomy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Point-biserial correlation

A

When one variable is dichotomous, and it is a true dichotomy (e.g. gender(?), pregnancy)

  • A point-biserial and biserial correlation is used to correlate a dichotomy with an interval scaled variable. The difference is that the point-biserial correlation is used when the dichotomous variable is a true or discrete dichotomy and the biserial correlation is used with an artificial dichotomy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Partial correlation

A

In partial correlation the effect of the third variable on BOTH variables is controlled

  • focuses on unique contributions-compares the unique variation of one variable to the unique variation of the other
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Semipartial correlation

A

the effect of the third variable is controlled ONLY FOR ONE of the variables

  • compares the unique variation of one variable with the unfiltered variation of the other
  • focuses on the predictive value of
    all variables combined
  • shows the increment in the correlation of one variable above and beyond another
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a regression?

A

A way of predicting things that you have not measured

  • Predicting an outcome variable from one predictor variable.

OR

  • Predicting a dependent variable from one independent variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the SSm

A

SSM is the difference between SST and SSR and represents the amount of improvement in predictions when using the Best model over the most basic model (the mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Differences between group means can be characterized as a regression (linear) model if:

A

The experimental groups are represented by a binary variable (i.e. coded 0 and 1).

in this case the predictor variables are categorical and can be expressed in a regression linear model if substituted with dummy variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly