Exam 2 Cumulative Review Flashcards

1
Q

dependent variable

A

-the variable that is dependent on the independent variable, what is being measured
-y-variable
-left hand/left side variable
regressand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

independent variable

A

-the variable that affects the dependent variable
-right hand variable
right side variable
-regressor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

7 steps for calculating predictor x in regression equation

A
  1. calculate E(X) and E(Y)
  2. calculate “actual minus expected” for each variable, for each observation
  3. square “actual minus expected” JUST FOR X
  4. DO NOT square “actual minus expected” for Y
  5. instead, take (actual - expected of X) x (actually - expected for Y)
  6. add up what you got for 3 and 5
  7. take the ratio of the two (with XY on the top)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

calculating intercept for regression equation

A

take the slope you calculating…

  1. multiply it by your E(x)
  2. subract that from your E(y)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how to interpret regression equation (use following example):
Y = 1.28 + .10443X

A

when the value of our X variable increases by 1, the value of our Y variable increases by .104

for every $10 increase in hourly income, the number of meals eaten out in a month increases by about 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

residual of a regression

A

the difference between the predicted Y and the actual Y according to the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

total sum of squares

A

the actual variation in Y (dependent variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

explained sum of squares

A

the modeled variation in Y (dependent variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

R^2

A

ratio of ESS/TSS

-used to tell how much of variance an IV explains in the DV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

7 steps to calculating R^2

A
  1. take the actual values of Y, get E(y)
  2. take (act - exp)Y; square it, add it up
  3. that’s the actual variation in Y
  4. you have already run the regression
  5. use the results to calculated predicted values of Y for every X
  6. take (Predicted - Expected)Y; square it; add it up
  7. take the ratio of #6 over #3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

classic linear regression model assumptions

A
  1. the model is linear in its parameters and has an additive error term
  2. the values for the IV’s are derived from a random sample of thepopulation and contain variability
  3. no IV is a perfect linear function of any other IV (no perfect colinearity)
  4. the error term has expected value of zero
  5. the error term has a constant variance (homoskedasity)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

if the error term has an expected value of zero and also has a constant variance…we can conclude what about the error term? and then what about the estimate for b-hat?

A
  1. error term has a normal distribution
  2. our estimate B-hat is a linear function of the error term
  3. therefore, B-hat is normally distributed
  4. this means we can test our individual estimators for significance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

equation for testing (tstat) individual estimator’s significance (assuming homoskedasticity)

A

(Betahat - 0) / standard error of Betahat

it is minus zero because that is our expected value for beta-hat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what does having a t-stat of 1.96 exactly mean for our beta-hats?

A

this means I could create an interval of a certain width (1.96 standard errors above and below our hypothesized mean) and if I repeated by sampling 100 times, 95 out of the 100 times that interval would contain the true population mean
-so if the interval for betahats on the printout from SAS contained zero, we cannot reject the null (beta = 0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

if our r^2 and SER isn’t great, what else can we check to find significance in our regression model?

A

we can check the t-stats for each individual beta-hat…we can’t conclude they are key drivers, but it does indicate and real and positive relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the standard error of beta hat?

A

-it is the standard deviation of the sampling distribution…it measures the spread

remember that beta hat is a random variable, therefore it has an expected value and a distribution
-that distribution has a spread, measured by the variance and the SD

17
Q

homoskedastic

A

variance of the error term is constant

18
Q

heteroskedastic

A

variance of the error term changes

19
Q

the smaller the standard error of our beta hat, the larger our t-stat. AND the larger our t-stat, the more likely we will reject the hypothesis that b = 0…why?

A

when our standard error gets small, this means the distribution is getting less and less wide

  • if our distribution of beta hat is very tight, our distribution is not very spread out
  • this means there’s less and less of a chance that values will vary much from our expected value…which means there is less of a chance that zero will be in that interval
20
Q

f-statistic

A

used to test a joint hypothesis

  • the hypothesis that ALL of our betas are really zero
  • B1=0 AND B2=0, not B1=0 OR B2=0
21
Q

formula for f-stat

A

(R^2 / k ) /
((1-R^2) / (n-k-1))

k is number of IV’s
n is number of observations

22
Q

formula for finding t-stat of restricted and unrestricted variables

A

((RSSrestricted - RSSunrestricted)/ q ) /

(RSSunrestricted) / n-k-1 )

23
Q

explained sum of squares (regression sum of squares)

A

sum of squares of the deviations of the predicted values from the mean value of a response variable, in a standard regression model

24
Q

Standard error of observations

A

the Standard deviation of the error

-large values imply that actual values are a long away from our fitted line

25
Q

p-value

A

the probability of obtaining a test statistic at least as extreme as the one that was actually observed, assuming that the null hypothesis is true

26
Q

binary variable

A

either/or situation to represent discrete variables

-in STATA, you identify one outcome as 1 and the other as 0