Week 1 Flashcards
Population
A large group, or repeated process, that we are studying.
Sample space
Collection of all possible samples. Uncertainty about which sample is drawn. The probability function P(x
Random variable
Describes a feature of the random sample. Formally, a rule that assigns a number to each possible sample.
Variance
Measures the spread of the distribution and also how accurate the prediction is.
Covariance
Measures how different variables move together - either in same direction, positive, or in opposite directions, negative. Or not correlated at all.
Exprectation
The expected value of a random variable is always the average of it.
Random sampling
A recipe for picking out points from the population at random and for recording their characteristics.
Realized sample
Recorded characteristics of the sample, a dataset with numbers.
Estimand, estimator, estimate
The estimand is the population feature that we try to infer using an estimator (rule) that gives us an estimate of the estimand.
Method of moments
Strategy for constructing an estimator by replacing population quantities by sample analogues.
Linear regression model (OLS-1):
Functional form. Assumption that the m (the regression curve) takes a functional form of a linear model with k regressors and excluded variables U. This means that it is linear in the coefficients, the betas, but can still have nonlinear variables in the x’s.
Regression curve
Rule that describes how the regressors contribute to the outcome. It is pinned down by k+1 parameters (coefficients)
Marginal effect
The causal effects that we want to compute. If we change on regressor by one (infinitesimal) unit and keep everything else constant (ceteris paribus), we get the marginal effect.
Conditional expectation
A recipe that tells us how to construct, for each state of the world, a “best” prediction of a rv by using the info contained in other rv’s.
Exogeneity assumption (OLS-2)
The regressors are not informative about the level of the U. The m gives the conditional expectation of the Y given the regressors. Cov(Xj, U)=0