cross sectional data and simple regression Flashcards
what does cross sectional data mean ?
Interested in the variables (Y,X) for example the relationship between demand D and price P
What is a population ?
A group of data, e.g. various individuals, firms, supermarkets and countries, we observe this simultaneously for N subjects drawn from this population
What are we traditionally interested in
The single variable Y for example as a return of a financial asset
Can data be regarded as a random sample
NO, it is important to account for trends, seasonality, temporal and serial dependence
So how is cross-sectional data indexed?
Yt, t=1,…,T
Explain simple regression
We are interested in a dependent (LHS, Explained, Response) variable Y, which is supposed to depend on an explanatory (RHS, Independent, Control, Predictor) variable X
Give an example of a simple regression
Demand is a response variable and price is the predictor variable. Another one is wage is the response variable and years of education is the predictor variable
What do we assume dealing with Cross-Sectional data means our observation pairs are?
(Y1, X1),…,(Yn, Xn)
These are stochastically independent and follow the same probability distribution drawn from the same population
What is the conditional mean?
The avg behaviour of a dependent variable Y can be summarised with its mean E(y)
What can we do if the explanatory variable X entails info about about Y ?
we can use it to refine the definition of the mean, we can express the avg behaviour of Y given X as the conditional mean of Y given X, E(Y/X)
Examples of the conditional mean
If wage is the response Y and years of education is the predictor X, then we are interested in the avg wage of a person with X years of education, E(Y/X)
If the outcome of throwing a dice is
E(Y) = (1+2+3+4+5+6)/6
Consider the explanatory variable:
X = 1 if Y is even
x = 0 if Y is odd
Calculate the conditional mean
E(Y/X) = 1: (2+4+6)/3 = 4
E(Y/X) = 0: (1+3+5)/3 = 3
we can summerise this as:
E(Y/X) = 4x+3(1-x)=3+x
This means that it is a function of x,
meaning E(Y/X) = g(x)
What is the alternative way of giving 2 defining properties to the conditional mean
- It is a function of X: E(Y/X) = g(X)
2 E(Y/X∈A) = E(E(Y/X/X∈A) for all sets of A ⊆ R
What this equation essentially says that the expected values of the conditional mean is equal to the unconditional expected values
In this course, we often assume that E(Y/X) is a linear function in X: E(Y/X) = β0 + β1X
What are some properties of the conditional mean?
- If X and Y are independent than E(Y/X) = E(Y)
- If Y = f(x) then E(f(X)/(X)) = f(X)
- if E(f(X)/(X)) = f(X)E(Y/X)
Law of total/iterated expectation: E(E(Y/X)) = E(Y)
Linearity:
E(Y1 + Y2|X ) = E(Y1|X ) + E(Y2|X ) , and E(aY |X ) = aE(Y |X )
I If Y ≥ 0 then: E(Y |X ) ≥ 0