Introduction to Linear Regression - Reading 4 Flashcards

1
Q

What is a dependent variable?

A

the variable whose variation is explained bu the independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the six main assumptions underlying a simple linear regression?

A
  1. A linear relationship exists between the dependent and the independent variables.
  2. The independent variable is uncorrelated with the residuals.
  3. The expected value of the residual term is zero.
  4. The variance of the residual term is constant for all observations.
  5. The residual term is independently distributed; that is, the residual for one observation is not correlated with that of another observation
  6. The residual term is normally distributed.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an independent variable?

A

the variable used to explain the variation of the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which method is used to estimate a simple linear regression?

A

Minimizing the sum of squared errors

(Yi-^b0-^b1Xi)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to estimate b0?

A

^b0=Ym-^b1Xm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to estimate b1?

A

^b1=Cov(x,y)/Var(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is important to verify before any conclusions about the coefficients?

A

Determine the statistical significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the standard error of estimate (SEE) and how to calculate?

A

Measures the degree of variability of the actual y-values relative to the estimated Y- values from a regression equation
SEE=(SSE/n-2)^0,5
SEE=(MSE)^0,5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the coefficient of determination and how to calculate?

A

tha percentage of the total variation in the dependent
variable explained by the independent variable
R^2=[(TotalVariation-UnexplainedVariation)]/Total Variation=ExplainedVariation/Total Variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the regression coefficient confidence?

A

Hypothesis testing for regression coefficient may use the confidence interval for the coefficient being tested.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

On regression coefficient intervals, what is the appropriate number of degrees of freedom?

A

n-k-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Formulate a null and alternative hypothesis about a population value of a regression coefficient

A

t=(^b-b_hypothesis)/Var(^b)

decision rule : reject H0 if t>tcritical or t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What the rejection of the null means?

A

the rejection of the null means

that the slope coefficient is different from the hypothesised value of b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the confidence interval for the predicted value?

A

ˆY+-(t_c x sf)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how to calculate the variance error of the forecast?

A

sf^2=SEE^2{1+[1/n]+[(X-Xm)^2/(n-1)s_x^2]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is ANOVA?

A

Analysis of variance (ANOVA) is a statistical procedure for dividing the total variability of a variable into components that can la attributed to different sources

17
Q

What is the Total Sum of Squares (SST)?

A

Measures the total variation in the dependent variable

SST=sum(U-Ym)^2

18
Q

What is the Regression Sum of Squares (RSS)?

A

Measures the variation in the dependent variable that is explained by the independent variable
RSS=sum(Y^-Ym)ˆ2

19
Q

What is the decomposition of total variation?

A

SST=RSS+SSE

20
Q

What is the F-test? How to calculate the F statistic? How many df are in the numerator?How many df are in the denominator? Is it a two tailed or one-tailed?

A
An F-test assesses how well a set of independent variables, as a group, explains the variation in the dependent variable.
F=MSR/MSE=(RSS/k)/(SSE/n-k-1)
df_numerator=k
df_denominator=n-k-1
*this is always a one-tailed test
Decision rule: Reject h0 if F>F_c
21
Q

What is another way to calculate the F statistic more easily for a simple linear regression?

A

F=(t_b1)^2

22
Q

What are the three main limitations of regression analysis?

A

The main limilations of regression analysis include the following :

  1. Parameter instability (especially when dealing with economic and financial variables).
  2. The limited usefulness of regression models in identifying profitable investment strategies based on publicly available information
  3. The possibility of violating the assumptions underlying regression analysis (heteroskedasticity and autocorrelation)