Final Exam Review Flashcards
steps to calculating a basic regression equation
- calculate E(x) and E(y)
- calculate actual minus expected for each variable, each observation
- square “actual minus expected” for only X
- for y, take (actual(x) - expected(x))*(actual(y)-expected(y))
- take the sums of 3 and 4
- take ratio of these sums with XY on top
this is your estimator
intercept = E(y) -E(x)*slope
interpret:
selling price = 150 + .027(square footage)
for every increase of 1 square foot, we can expect to see the selling price of the house increase by .027
Total sum of squares
total actual variation in Y
sum of (Actual Y - expected Y)^2
Explained sum of squares
the modeled variation in Y
sum of (estimated y - expected Y)^2
steps for an R^2
- take the actual values of Y, get E(y)
- take (ACTy - EXPy), square it, add it up
- thats the actual variation in Y
- use the results of regression to calculate predicted values of Y for every X
- take (PREDICTEDy-EXPECTEDy); square it; add it up
- take the ratio of 5 over 3 (ESS/TSS)
r^2
how much of the variance in our data is explained by the regression model
Standard Error of OBservations
the standard deviation of the error term
-how far are the actual values from our fitted line
What does a t-stat of 1.96 actually mean?
this means you could create an interval of a certain width (1.96 SE above and below our hypothesized mean) and if you repeated your sampling 100 times, 95/100 times that interval would contain the true population mean
standard error of the beta hat
remember beta is a random variable
- therefore, it has an expected value and distribution
- that distribution, the sampling distribution, has a spread, measured by the variance and the SD
- the standard error of beta hat is the SD of the sampling distribution…it measures the spread
heteroskedastic
variance of the error term changes
homoskedastic
variance of the error term is constant
why does a smaller standard error of the beta hat usually correlate with a larger t stat?
because as SE decreases, the width of the distribution gets smaller and smaller…decreasing our chances of having an insignificant value…spread is TIGHT
F-statistic
-used to test a joint hypothesis that ALL of our betas are really zero
formula for an f-statistic
(1-R^2)/(n-k-1)
k is # of IVs
whats the formula for testing a subset of variables?
(RSSrestricted-RSSunrestricted)/q
______________________________
RSSunrestricted/n-k-1
after running a simple p and q demanded regression, how do we find the elasticity of a product at a certain point?
ex: q = 1147 - 7.93*p, p = 70
(p/q)*coefficient = elasticity at that point
70*-7.93 = 555
1147-555=592
-7.93*(70/592)
what’s the most basic way to estimate a curvilinear regression?
convert the data to natural logs, dummy
how do we convert an equation BACK from a log?
- take the exponential of your intercept term
- take your coefficient (elasticity) and raise your P to it
- multiply these things
ex: Q demanded = 23,156*p^(-.873)
log-log (def and interpretation)
- take the log of both the variables
- changes interpretation to percentages
ex: a certain % change in X is associated with a certain % change in Y
log-linear (def and interpretations)
- convert the DV to logs
- leave IVs just as they are
- changes to the DV are interpreted as percentages, NOT units