07 Linear Regression Flashcards

Question

TSS

Answer 1

Total sum of squares: Sum[(Yi - avg(Y))^2] TSS = ESS + SSR

Answer 2

Explained sum of squares: | Sum[(^Yi - avg(Y))^2]

Answer 3

Sum of squared residuals: | Sum[û_i^2]

Answer 4

The regression R^2 is the fraction of the sample variance of Yi explained by Xi: R^2 = ESS / TSS = 1 - SSR / TSS R2 = 0 - none of the variation in Yi is explained by Xi R2 = 1 - all the variation is explained by Xi, all the data points lie on the OLS line. A high R2 means that the regressor is good at predicting Yi (not necessarily the same as a ”good” regression)

Answer 5

The standard error of the regression (SER) is an estimator for the standard deviation of the regression error u_i. SER = SSR / (n - 2) It measures the spread of the observations around the regression line.

Answer 6

If the independent variable is multiplied by som nonzero constant c, then the OLS slope coefficient is divided by c.

Answer 7

The error u has the same variance given any value of the explanatory variable, in other words: Var(u|x) = 2 Homoskedasticity is not required for unbiased estimates, but it is an underlying assumption in the standard variance calculation of the parameters. To make the variance expression easy the assumption that the errors are homoskedastic are added.

Answer 8

The larger the variance of X, the smaller the variance of E(β1)

Answer 9

Holy shit. (Appendix 4.3)

Answer 10

Sample covariance 1 / (n - 1) * sum{(Xi - avg(X))(Yi - avg(Y))}

Answer 11

Sample variance of X | 1 / (n - 1) * sum{(Xi - avg(X))(Xi - avg(X))}

Answer 12

r_{XY} = s_{XY} / (s_X * s_Y)

Answer 13

A variable is consistent if the spread around the true parameter approaches zero as n increases

Answer 14

The population error u is independent of explanatory variables and is Normal(0, σ^2) * Whenever y takes on just a few values it cannot have anything close to a normal distribution. * The exact normality of OLS depends on the normality of the error. * If the βˆ is not normally distributed the t-statistic does not have t-distribution. * The normal distribution of u is the same as the distribution of Y given X. * In large samples we can invoke the CLT to conclude that the OLS satisfy asymptotic normality.

Answer 15

Normal[β1, Var(E(β1))] Thus (E(β1) − β1) / std(E(β1)) ∼ Normal(0, 1) This comes from: • A random variable which is a linear function of a normally distributed variable is itself normally distributed. • If we assume that u ∼ N(0, σ2) then Yi is normally distributed. • Since the estimators βˆ and βˆ is linear functions of the Yi’s then the estimators are normally distributed.

Answer 16

t = (estimator - hypothesised value) / standard error of the estimator

Answer 17

A coefficient can be statistically significant either because the coefficient is large, or because the standard error is small.

Answer 18

Heteroskedasticity robust standard errors In econometric applications the errors are rarely homoskedastic and normally distributed, but as long as n is large and we compute heteroskedasticity robust standard errors we can compute t-statistics and hence p-values and confidence intervals as normal.

Answer 19

(OLS) Most often the violated assumption is the zero conditional mean assumption, X is often correlated with the error term.

Answer 20

Sum[ Xi (Yi - avg(Y)) ]

07 Linear Regression Flashcards

(44 cards)