Correlation and Regression Flashcards

1
Q

Correlation measures (2)

A

1) direction (positive/negative)
2) linear association between 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

range of correlation coefficient

A

-1 to + 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

measures co-movement between 2 variables

A

function of covariance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

whats the difference between covariance and correlation

A

summation of movement between 2 variables within a range of - to + infinity or - to +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What doesn’t a scatter plot show?

A

The slope of the correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is spurious correlation?

A

correlation that is random chance; correlation is not causation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

linear relationship between dependent and independent variable

A

linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

dependent variable

A

y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

independent variable

A

x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

assumptions of classic normal linear regression model

A
  1. variables are linear
  2. independent variable, x is not random
  3. error is 0
  4. variance is the same for all observations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

2 measures of regression quality

A
  1. standard error of estimate
  2. coefficient of determination
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how is standard error of estimate calculated?

A

root sum of differences between y and x components of the regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

why is the coefficient of determination preferred?

A

SEE shows the change but CoD reveals how much of the variability of Y is because of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

3 things to know for hypothesis testing with regression/correlation

A
  1. estimated parameter value
  2. hypothesized parameter value
  3. CI around the estimated parameter value at a given degree of confidence (probability)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is analysis of variance?

A

measurement of variances across means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how is multi-regression different from regression

A

effect multiple independent variables (x) have on a a dependent variable

17
Q

assumptions of multi-regression

A
  1. variable relationships are linear
  2. independent variables are not random
  3. error of independent variable is 0, error terms are not correlated and are part of a normal distribution
  4. variance for all observations is the same
18
Q

multi-regression hypothesis differences

A
  1. null - all coefficients are 0
  2. alternative hypothesis - at least 1 hypothesis is not 0
19
Q

difference between t-test and f-test

A

f-test - overall significance of the model
t-test - significance of each coefficient within the model

20
Q

3 violations of multi-regression assumptions:

A
  1. heteroskedasticity
  2. serial correlation
  3. multico
20
Q

homoskedasticty

A

variation of errors is constant across observations

21
Q

whats the result of heteroskedasticity

A

inflated t-statistics and underestimated errors

22
Q

serial correlation

A

regression error terms are correlated across observations

22
Q

multicollinearity

A

2 or more independent variables are highly correlated

23
Q

model characteristics:

A
  1. based on cogent economic reasoning
  2. variables appropriate for the model with particular roles
  3. as simple as possible
  4. examined for assumption violations
  5. useful out of sample
24
Q
A