Lecture 6: Predictive Data Analysis Flashcards

1
Q

Before doing regression analysis, we can use a ? -graph plot to determine if the two variables have any linear relationship.

A

Before doing regression analysis, we can use a scatter-graph plot to determine if the two variables have any linear relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ordinary Least Squares (OLS):
Minimise the ?? to find the best fit line.

With assumptions, OLS provides the ? (?), ?, ? Estimates (BLUE) of the model parameters (e.g. the betas)

A

Ordinary Least Squares (OLS):
Minimise the error variance to find the best fit line.

With assumptions, OLS provides the Best (consistent), Linear, Unbiased Estimates (BLUE) of the model parameters (e.g. the betas)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Assumptions of OLS:

  1. ?
  2. E(ε) = ?
  3. ? / Exogeneity: E(ε|X) = ?
  4. No measurement errors in X
  5. Homoskedasticity: Var(ε) = ?
  6. No ?: cov( ε_i, ε_j) = ?
A

Assumptions of OLS:

  1. linearity
  2. E(ε) = 0
  3. Independence/ Exogeneity: E(ε|X) = 0
  4. No measurement errors in X
  5. Homoskedasticity: Var(ε) = σ^2
  6. No autocorrelation: cov( ε_i, ε_j) = 0
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

OLS Method:
Minimise ????
Equation?

A

OLS Method:
Minimise sum of squared deviations:
Equation: https://images.app.goo.gl/ZeCmejWxkk7Jqb8r6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

OLS Method:
How to minimise sum of squared deviations?
Taking ?? and setting them to ?.

A

OLS Method:
How to minimise sum of squared deviations: Taking partial derivatives and setting them to 0.
https://images.app.goo.gl/Va4P582A3fDjFB3A9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

To check model fit, use ??? (i.e. ?-?):
proportion of change in ? that can be explained by change in ?
? <= R-squared <= ?

A

To check model fit, use Coefficient of Determination (i.e. R-squared):
proportion of change in Y that can be explained by change in X
0 <= R-squared <= 1 (100%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

To test parameters’ significance, use ? testing & ?statistics:
Significance if ?? > critical value

A

To test parameters’ significance, use hypothesis testing & t statistics:
Significance if t-value > critical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Panel Data: ?? data (e.g. industries, firms,…) + ?-series data

A

Panel Data: cross-sectional data (e.g. industries, firms,…) + time-series data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly