Unit 11 Correlation vs Regression Flashcards

1
Q

Correlation Analysis examines..

A

The strength (intensity) of the relationship between two experimental variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Two things correlation analysis assesses

A

If the amount of variation in one variable is explained by another variable
If the explained variation is greater than can be expected by chance alone, ie. is it significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Regression Analysis…

A

Measures the form of the relationship and describes
the functional relationship between variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Where does regression usually apply

A

Applies when we have control of the X variable
(independent) and can measure it essentially without error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

the centroid is…

A

The equivalent of the mean for a double statistical series

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The covariance is

A

The covariance between X and Y (Sxy) is the equivalent to the variance (S) for a double statistical series
(two continuous variables, X and Y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does covariance describe

A

Describes the dispersion of two or more quantitative variables
Is symmetric: The covariance Sxy equals covariance Sxy(Diagonally)
❑ Variances are always positive (squared)
❑ Covariances can be negative: vary from -∞ to + ∞

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Variance is…

A

Covariance matrix or dispersion matrix (dispersion around the mean) (S matrix)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Covariance measures…

A

Covariance measures the joint dispersion of two quantitative variables around their centroid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Correlation is…

A

Correlation is a statistical technique used to determine the degree to which two variables are related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Two correlation coefficients are considered:

A

Pearson product moment correlation coefficient (r)
employed with interval or ratio scaled variables (parametric)
Spearman rank order correlation coefficient (r rho) employed with ordinal or ranked data (non-parametric)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Pearson Correlation Coefficient (rxy) measures

A

Only the linear relationship between two quantitative variables
It is the covariance of standardized variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

In correlation a positive relationship means…

A

Means that ‘individuals’ obtaining high scores on one variable tend to obtain high scores on a second variable.
The converse is also true, i.e. individuals scoring low on one variable tend to score low on a second variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In correlation a negative relationship means…

A

Means that ‘individuals’ scoring low on one variable tend to score high on a second variable
Conversely, individuals scoring high on one variable tend to score low on a second variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Residuals is..

A

The distance between points and what you are measuring.
They are the error terms.
If there are small residuals then there is strong a correlation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The total variance of y can be partitioned in:

A
  1. explained variance
  2. residual variance
17
Q

Linear regression is:

A

Linear regression is a form of statistical modelling (we are predicting the future)

18
Q

Linear regression can aim to:

A

Describe a functional model linking y and x
Test hypothesis regarding model parameters.
Predict y as a function of y.

19
Q

How to quantify correlation between variables:

A

Pearson’s correlation coefficient

20
Q

What does Pearson’s Correlation Coefficient assess.

A

The sign of r denotes the nature of association (positive or negative)
The value of r denotes the strength of the association between (-1) and (+1)
The further r is away from 0 the larger the sample size. So the less likely the association could have occurred by chance if there was no real association between the variables considered.

21
Q

The closer the r value is to 0…

A

The greater the variation around the line

22
Q

When do we use Pearson r significance test?

A

To test if a relationship exists between two variables, and determine the magnitude and direction.
To test if two sets of paired measurements, neither of which is clearly independent of the other, are linearly associated.

23
Q

Assumptions of Pearson’s r significance test

A

For each subject in the study there must be related pairs of scores:
- If a subject has a score on variable x, then the same subject must have a score on variable y.
- The variables must have a linear relationship. ie. the relationship can be most accurately represented by a straight line.

24
Q

Rules for Pearson’s correlation hypothesis testing

A
  • Significance (two-tailed) or p-value > 0.05, reject H0 There is no correlation between the two variables.
  • Significance (two-tailed) or p-value ≤ 0.05, reject H0
    Assume there’s linear correlation between the variables. There is a significant association between variables.
25
Q

Word of caution for correlation and causation

A

Correlation doesn’t imply causation

26
Q

Correlation analysis examines:

A

the strength (intensity) of a relationship between two variables.

27
Q

Regression analysis measures:

A

the form of the relationship and describes the functional relationship between variables.
regression usually applies when we have control of the x variable (independent) and can measure it without error.

28
Q

Regression:

A

While correlation is concerned with the magnitude and direction of the relationship, regression focuses on using the relationship as a prediction model

29
Q

Linear regression is a form of:

A

Statistical modelling
(we are predicting the future)