Correlation & Partial correlation (W8)✅ Flashcards

1
Q

What is meant by bivariate linear correlation?

A

Examines the relationship between two variables

Relationships vary in:

  1. Form
    * Linear
    * Curvilinear
  2. Direction
    * Positive
    * Negative
  3. Magnitude/strength
    * r = - 1 (perfect negative relationship)
    * r = +1 (perfect positive relationship)
    * r = 0 (no relationship)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the hypothesis testing in linear correlation?

A
  • Measure the relationship between two variables in a sample
    => use sample statistics to estimate the population parameters
  • Always assume null hypothesis is true: there is no relationship between the population variables.
  • Once we’ve determined the relationship in our sample, inferential analyses allow us to determine: the CHANCE of measuring a relationship of that magnitude when the null hypothesis is true
    -> if p < 0.05 then can reject null hypothesis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the main 4 parametric assumption of correlational studies (and 1 thing to look out for)?

A
  1. Both continuous variables
    -> if one or both is ordinal, use non-parametric alternative
  2. Related pairs: Each participant should have a pair of values (x, y)
  3. Absence of outliers (can skew correlation if present)
  4. Linearity: scatterplot should be best explained with a straight line -> NOT curved line

Note: sensitive to range restrictions

=> If seriously violated these assumptions: use non-parametric equivalent (Spearman’s rho)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Pearson’s correlation coefficient (r)?

A
  • Long name – Pearson product-moment correlation coefficient (PPMCC)
  • Correlation coefficient (r) is a measure of the strength of
    association between the two quantitative, continuous variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the relationship between covariance and correlation coefficient (r)?

A
  • Covarience (cov): measure of the variance shared between x and y variables
    1. For each datapoint, calculate the difference from the mean of x and y (e.g. xi - xm AND yi - ym)
    2. Multiply the difference
    3. Sum the multiplied differences
    4. Divide by N-1
  • Correlation coefficient (r): the ratio of covariance to separate variance (multiply SDs of x and y)
    -> r = cov(x,y) / (Sx*Sy)
  • r reflects how well a straight line fits the datapoint (strength of correlation)
    -> If datapoints cluster closely around the line, r is FURTHER from 0 (STRONGER)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is sampling error in correlational studies?

A

Difference in r-values if we recruit another sample from the population and measure the correlation strength of it.

r-distribution have a 95% confidence interval
-> there is a 5% chance that the population’s r falls above/below the 95% CI limit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is meant by shared variance? (r^2)

A

r^2: expresses the proportion of the separate variances that is shared

NOTE! r = .8 ix 4x as strong as r = .4 (power of squared)

r is another useful measure of effect size -> squared to give a measure of shared variance (similar to partial eta squared)
=> Tell how much the variance in y can be ‘explained by’ x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is meant by partial correlation?

A
  • Allows us to examine the relationship between two variables, while removing the influence of 3rd variable
  • we can control for [3rd variable] by statistical means
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to interpret partial correlation: change in r when control for the 3rd variable?

A
  1. If r-value decreases AND is not significant -> suggest the relationship between V1 and V2 may be explained by the influence of V3 on both 1&2.
  2. If the r-value decreased but remained significant -> suggest that the relationship was partially explained by V3
  3. If the r-value not decreased & remained significant -> suggest that the relationship was not influenced by V3.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly