Topic 8: Correlation And Linear Regression Flashcards

1
Q

Correlation features (2)

A

Strength of linear relationship between only 2 variables.

Does not imply causation. E.g correlation between ice cream and sunburn, but there is no causation; the weather causes both!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Regression

A

Estimate one variable on the basis of another

Assumes there IS causal effect from variable to the other

Attempts to describe the dependence of a variable on another variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Coefficient of correlation ranges from…

A

-1 to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Perfect positive vs perfect negative correlation

A

R=1 positive
R=-1 negative
R=0 no association.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to test significance of correlation

A

Hypothesis testing
H₀=p=0
H₁=p≉0

Null is that the correlation in population is 0 (no correlation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Steps

A

Set null/alternate

Choose significance level

Select test statistic

Formulate decision rule

Calculate statistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the usual significance level

A

5%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Test statistic formula

A

T=r x √n-2
/
√1−r²

Degrees of freedom. ~ t(n-2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Formulate decision rule meaning

A

2 tailed test: reject null if |T| is>critical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Dependent variable

A

The variable being predicted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Independent variable

A

Predictor variable, provides the basis for estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bivariate regression analysis

A

2 variables, independent variable estimates the dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Two assumptions for bivariate regression analysis

  1. What do we need in bivariate regression analysis
A

Relationship is linear

Both variables are interval or ratio scale

  1. We need slope and y-intercept
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Regression equation

A

Y=a+bx+e

Y=dependent variable (one we predict)
X=independent variable (basis of prediction)
A=intercept term
B=slope of regression line
E=error term (distance between actual point and predicted point, as cannot predict y perfectly)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Principle of least squares

A

Chooses line of best fit that minimises the errors (sum of squared errors)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Other ways to express regression line

A

Actual y (Y)= predicted y (Y hat) +error (e)
Y=Y hat+e

We saw earlier y=a+bx+e

Therefore
Y hat=a+bx

17
Q

How to find a and b in the regression line formula

A

FIND B first (the slope)

B=r(Sy/Sx)

THEN A (the intercept
A=Y hat-b(X bar)

R=correlation coefficient