lecture 2 Flashcards
SR: how to draw a sample
population -> draw a sample -> gives us our statistic aka estimator -> use this to estimate parameter -> get our parameter of interest (u)
X-bar
sample mean (parameter of interest)
what is X-bar the same thing as
E(X)
what is E(X)
expected value (mean u of a discrete random variable X)
instead of population mean we are interested in
population conditional mean
population conditional mean
mean of conditional distribution: find the average of something conditional on another variable, like wage conditional on gender or expected value of GPA conditional on high school GPA
conditional distribution
The probability distribution of one random variable given that another random variable takes on a particular value
true line
- arguing that this exists
- population regression line
- very few observations lie on it
- Y = Bo + B1X + u
u
the error term; allows for discrepancies
u = Y - Bo - B1X
u on a regression
the vertical distance from the point to the line
in regression of Y on X
Y is on the left side, X is on the right
Terminology when Y is the dependent variable
X = independent variable
u = error term
Terminology when Y is the explained variable
X = explanatory variable
u = unobserved heterogeneity
Terminology when Y is the regressand
X = regressor
u = disturbance
Terminology when Y is the left-hand side variable
X = right-hand side variable
u =
use the equation: Y = Bo + B1X + u to represent
the conditional expectation of Y
the conditional expectation of Y
E(Y|X) = Bo + B1X
what do we need to know to find the expected value of Y given X
need to know Bo and B1
we use sample mean as an estimator of the population mean so we use
estimators for Bo and B1
where do we get estimators for Bo and B1
draw a sample from the population then
B hat o
estimated value of the y-intercept in a linear regression model
- predicted value of Y when all independent variables are zero
B hat 1
estimated slope of a regression model
- change in y for one unit increase in x
adding a subscript to our regression
reflects that we now have data