Ch2: Woolridge: The Simple Regression Model Flashcards

1
Q

What is the simple regression model used for?

A

To study the relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the simple regression model study?

A

the two variables x and y. it studies how y varies with changes in x. example: y is a community crime rate and x is number of police officers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the equation for the simple regression model?

A

y = b0 + b1x + u

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the different names for the Y variable?

A

dependent variable
explained variable
response variable
predicted variable
regressand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different names for the X variable?

A

Independent variable
Explanatory variable
Control variable
Predictor variable
Regressor
Covariate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is u?

A

The error term or disturbance in the relationship. It represents factors other than x that affects y. the model treats all of those factors as being unobserved. so think of u as unobserved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is b1?

A

the slope parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is b0?

A

the intercept parameter, aka constant term

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the natural measure of the association between two random variables?

A

the correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What happens if u and x are uncorrelated?

A

there will not be linearly related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the assumptions of the ordinary least squares?

A

Assumptions of OLS:
- Linearity: The relationship between X and Y is linear.
- Independence: Observations are independent of each other.
- Homoscedasticity: Variance of errors is constant across all levels of X and Y.
- Normality: Errors are normally distributed (important for inference).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the residual?

A

The difference between the actual value and its fitted value. (actual - estimated) y - yhat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the sum and the sample average of the OLS residuals?

A

zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the sample covariance between the regressors and the OLS residuals?

A

zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the total sum of squares (SST)?

A

Total candies (SST): All the candies in the jar, including the ones you guessed right and wrong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the explained sum of squares (SSE)? (aka sum of squares residuals)

A

Residual candies (SSR): These are the candies that you didn’t guess right. So, SSR is like the mistake or the difference between how many candies you thought there were and how many there actually were. It’s the part your guess didn’t get right!

17
Q

What is the explained sum of squares (SSE)?

A

Explained candies (SSE): The candies that your guess got right—those are the ones you were able to predict correctly.

18
Q

What R-squared of the regression? (sometimes called the coefficient of determination) (goodness-of-fit)

A

R^2 = SSE/SST = 1 - SSR/SST

R-squared is like a score that tells you how well your guesses match the actual number of candies.

If R-squared is 1, it means your guesses are perfect—there’s no difference between what you guessed and what was actually in the jar.

If R-squared is 0, it means your guesses are really far from the truth, and they don’t help explain anything about the candies.

19
Q

What is the constant elasticity model?

A

The constant elasticity model is a way to describe how one thing (like price) affects another thing (like quantity) in a consistent way, no matter how big or small the values are.

Imagine you’re selling candies, and you notice that every time you change the price, the amount of candies sold changes in a consistent percentage.

In a constant elasticity model:
- Elasticity tells us how much one thing changes in percentage when another thing changes by 1%.
- Constant means the percentage change stays the same no matter what.

For example, if the price of candies goes up by 10%, the quantity sold might go down by 5%. If that 5% decrease happens every time you increase the price by 10%, then the elasticity is constant.

In simple terms, the constant elasticity model says, “If I change one thing by a certain percentage, another thing changes by a fixed percentage every time.”

20
Q

What is the zero conditional mean?

A

The zero conditional mean is a rule in econometrics that says the average of the error terms (the “mistakes” in the model, u, the residual) should be zero when you have certain conditions.

Imagine you’re guessing how many candies are in a jar based on some clues. The error is how far your guess is from the actual number of candies. The zero conditional mean means that, on average, your guesses will be correct (you don’t systematically overestimate or underestimate).

In more technical terms, it means that the average value of the errors should be zero when the values of the predictors (like the size of the jar or the color of the candies) are known. This is important because it ensures that your model doesn’t have bias and that your guesses are fair.

21
Q

What is homoskedasticity?

A

this assumption says that the variance of the unobservable, u, conditional on x, is constant. aka constant variance assumption.

Homoskedasticity means that the mistakes you make in your predictions are spread out evenly for all values of your predictor.

Imagine you’re guessing how many candies are in a jar based on the size of the jar: If your mistakes are small for both small and large jars, and they look about the same for every jar size, that’s homoskedasticity.

homoskedasticity just means that your mistakes are consistent no matter what size jar (x) you’re looking at!

22
Q

What is heteroskadasicity?

A

if your mistakes are bigger for large jars and smaller for small jars, that’s heteroskedasticity, which means the spread of the mistakes changes with the size of the jar.

23
Q

What is the difference between errors and residuals?

A

Errors are unknown and represent the true mistakes.

Residuals are what we actually calculate based on our model’s predictions, acting as estimates of the errors.

24
Q

What is the binary variable? (aka dummy variable)

A

can take two variables: 0 and 1

25
Q

What does squared error explain?

A

How much of the total variation is not described by the regression line?

26
Q

What is the coefficient of determination? (also called r squared) (khan)

A

tells what % of total variation is described by the line (the variation in x). tells us how good of a fit the line is in the plot relative to observer points!

27
Q

What is the exogeneity of covariates?

A

The exogeneity of covariates means that the independent variables (or predictors) in a regression model are not related to the error term. It’s a key assumption in econometrics that ensures your model gives reliable and unbiased estimates of the coefficients.