linear regression with one regressor Flashcards

1
Q

what is the population regression line?

A

it is the expected value of Y given X ie E(Y/X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the slope of the population regression line?

A

the slope is the difference in the expected values of Y, for two values of X that differ by one unit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what can the estimated regression be used for?

A

casual inference (learning about the casual effect on Y of a change in X)
prediction ( predicting the value of Y given X for an observation not in the data set)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is the problem of statistical inference for linear regression?

A

the problem of statistical inference for linear regression is, at a general level, the same as for estimation of the mean or of the differences between two means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does the statistical or econometric inference about the slope entails?

A

1) estimation - how should we draw a line through the data to estimate the population slope
2) hypothesis testing - how to test whether the slope is zero
3) confidence interval - how to construct a confidence interval for the slope?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the equation for the population regression line?

A

test score = β0 + β1STR where β1 = slope of population regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why are β0 and β1 “population” parameters?

A
  • We would like to know the population value of β1.
  • We don’t know β1, so must estimate it using data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the equation of the population linear regression model?

A

Yi = β0 + β1Xi + ui, i = 1,…, n
* We have n observations, (Xi, Yi), i = 1,.., n.
* X is the independent variable or regressor
* Y is the dependent variable
* β0 = intercept
* β1 = slope* ui = the regression error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does the regression error consist off?

A

The regression error consists of omitted factors. In general, these omitted factors are other factors that influence Y, other than the variable X. The regression error also includes error in the measurement of Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how can you estimate the β0 and β1 from data?

A

we can use the ordinary least squares or OLS estimator of the unknown parameters β0and β1. The OLS estimator solves: min(b0,b1) ∑ [Yi - (b0 + b1Xi)]^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the equation for 𝛽1 ?

A

r(xy)*[Sy/Sx] where r(xy) is the sample correlation and Sy and Sx is the sample standard deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the equation for 𝛽0?

A

𝛽0 = mean of Y - 𝛽1*mean of X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what does the regression R^2?

A

the regression R^2 measures the fraction of the variance of Y that is explained by X; it is unitless and ranges between zero (no fit) and one (perfect fit)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does the standard error of regression(SER) measure?

A

the standard error of the regression measures magnitude of a typical regression residual in the units of Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the equation for R^2?

A

R^2 = ESS/TSS = [∑(Ŷ-mean of Ŷ)^2]/[∑(Y-mean of Y)^2]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the standard error of regression(SER)?

A

The SER measures the spread of the distribution of u. The SER is(almost) the sample standard deviation of the OLS residuals

17
Q

what is the equation for the SER?

A

SER= [1/(n-2) *∑û^2]^(1/2)

18
Q

what is the definition of casual effect on Y of a unit change in X?

A

it is the expected difference in Y as measured in a randomized controlled experiment

19
Q

for a binary treatment, what is required for a difference in means to measure a causal effect?

A

it requires random assignment or as-if random assignment. random assignment ensures that the treatment (X) is uncorrelated with all other determinents of Y, so that there are no confounding variables

20
Q

what are the least squares assumptions for causal inference?

A

1) the conditional distribution of u given X has mean zero, that is E(u\X=x)=0
2) (X Y), i=1,….,n, are i.i.d. this is true if (X, Y) are collected by simple random sampling and delivers the sampling distribution of β0 and β1
3) large outliers in X and/or Y are rare. technically X and Y have finite fourth moments. outliers can result in meaningless values of β1

21
Q
A