Session 3 Flashcards

1
Q

mean of the probability distribution

A

mu (u) true mean sample mean = x-bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

law of large numbers

A

as the number of observations drawn increases, the mean of x-bar (sample mean) eventually approaches the mean mu (true mean of population) as closely as you specified.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

standard deviation =

A

= square root of the variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

variance

A

is the avg squared deviation of the values of the variable from their mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

3 assumptions on linear regression model under which OLS gives appropriate estimator

A
  1. conditional distribution of Ui given Xi has a mean of zero (other factors contained in Ui are unrelated to Xi – that is, given a value of Xi, the mean of the distribution of these other factors is 0). IN other words, the error term Ui has a condition mean zero given Xi:E(Ui|Xi) = 0. 2. (Xi, Yi), i = 1,….n. Are independently and identically distributed. 3. large outliers are unlikely in other words 1. OLS estimator is unbiased, b/c error term has conditional mean of 0 2. Xi and Yi are independent and indentically distributed 3. large outliers unlikely.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

if the least squares assumptions hold then the OLS estimators of the slope and intercept are: _____, _______, and have a ______

A

if the least squares assumptions hold then the OLS estimators of the slope and intercept are: unbiased, consistent, and have a sampling distribution with a variance that is inversely proportional to the sample size n. OR, OLS estimators as unbiased, consistent, & normally distributed when the sample is large.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

binary variable is also called ___ or ____

A

indicator or dummy variable example: male or female, urban or rural

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The two conditions for OVB are:

A

The two conditions for OVB are: 1. X2 (or priGPA) is a determinant of Y 2. X2 (priGPA) and X1 (attend) are correlated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The formula for bias

A

The formula for bias α1-1 = γ1 * β2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A high R2 means that the regressors…… A high R2 does not mean that you have eliminated …. A high R2 does not mean that you have an unbiased… A high R2 does not mean that the included variables are ….

A

A high R2 means that the regressors explain the variation in Y. A high R2 does not mean that you have eliminated omitted variable bias. A high R2 does not mean that you have an unbiased estimator of a causal effect (1). A high R2 does not mean that the included variables are statistically significant – this must be determined using hypotheses tests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

linear log log linear log log

A

Linear Log: A 1% change in X is associated with a change in Y of 0.01β1 Log Linear: A change in X by 1 unit is associated with a 100β1% change in Y Log log:A 1% change in X is associated with a β1% change in Y (β1 is the elasticity with respect to X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

OLS Assumptions

A
  1. E(u|X = x) = 0. The conditional distribution of u given X has mean zero.
  2. (Xi,Yi), i =1,…,n, are i.i.d.
  3. X and u have four finite moments.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
A
  1. Assumption 1 is violated. for any slice, the average of residuals = 0
  2. Assumption 2 is violated. you can see predictive cycles, there’s a cyclical pattern, likely seasonal data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
  • Assumption 1 is violated.
  • Assumption 2 is violated - some trending happening
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
A
  • Assumption 1 is violated.

Correction: Likely large outlier, could take log transformation on x or y axis. and bring down effect of outlier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

r2 is the….

A

percentage of variance explained by the model

17
Q

key assumptions in multiple linear regression

A
  1. the conditional distribution of u given the X’s has mean zero, that is, E(u|X1 = x1 …. Xk = xk) = 0
  2. X1i ….. Xki, Yi), i=1….n, are independent and identically distributed
  3. X1…Xk and u have four finite moments
  4. There is no strong multicollinearity
18
Q

linear log interpretation

Yi = B0 + B1ln(Xi) + ui

A

A 1% change in X is associated with a change in Y of 0.01β1

19
Q

log linear interpretation

ln(Yi) = Bo + B1Xi + ui

A

A change in X by 1 unit is associated with a 100β1% change in Y

20
Q

log-log interpretation

ln(Yi) = B0 + B1ln(Xi) + ui

A

A 1% change in X is associated with a β1% change in Y (β1 is the elasticity with respect to X)

21
Q

interaction term interpretations

A