Book 1_Quan_Simple Linear Regression Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Linear regression definition

A

provides an estimate of the linear relationship between an
independent variable (the explanatory variable) and a dependent variable (the
predicted variable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The general form of a simple linear regression model

A
  • Yi = bo + b1 Xi + ei
    + b1 = fitted slope coefficient = COVxy/stdx^2
    + b0 = fitted intercept = Y – b1 X

+ Dependent variable: Y
+ Independent variable: X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The estimated intercept, b0

A

represents the value of the dependent variable at the
point of intersection of the regression line and the axis of the dependent variable
(usually, the vertical axis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The estimated slope coefficient, b1

A

is interpreted as the
change in the dependent variable for a one-unit change in the independent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Assumptions made regarding simple linear regression include the following:

A
  1. A linear relationship exists between the dependent and the independent variable.
  2. The variance of the residual term is constant (homoskedasticity).
  3. The residual term is independently distributed (residuals are uncorrelated).
  4. The residual term is normally distributed.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Linear Relationship

A

A linear regression model is not appropriate when the underlying relationship
between X and Y is nonlinear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Homoskedasticity

A

refers to the case where prediction errors all have the same variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Normality

A

When the residuals (prediction errors) are normally distributed, we can conduct hypothesis testing for evaluating the goodness of fit of the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Outliers

A

observations (one or a few) that are far from our regression line (have
large prediction errors or X values that are far from the others)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Analysis of variance (ANOVA)

A

a statistical procedure for analyzing the total
variability of the dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The total sum of squares (SST)

A

measures the total variation in the dependent
variable

SST = total (Yi – Ymean)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The mean square regression (MSR)

A

the SSR divided by the number of independent variables

(MSR) = RSS/k

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The sum of squares regression (SSR)

A

measures the variation in the dependent variable that is explained by the independent variable

RSS = total (expected Yi – Ymean)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The sum of squared errors (SSE)

A

measures the unexplained variation in the
dependent variable

total (Yi – expected Yi)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The mean squared error (MSE)

A

is the SSE divided by the degrees of freedom,
which is n − 1 minus the number of independent variables

(MSE) = SSE/(n-k-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Total variance formula

A

total variation = explained variation + unexplained variation
or:
SST = SSR + SSE

16
Q

Standard Error of Estimate (SEE)

A

is the standard deviation of its residuals. The lower the SEE, the better the model fit:

(SEE) = căn (MSE)

Measure the degree of variability of the actual Y values relative to the estimated Y values from a regression equation

17
Q

Coefficient of Determination (R2)

A

The percentage of the variation of the dependent variable that is explained by the independent variable

= SSR/SST => Càng lớn thì đường regression càng đúng

18
Q

An F-test meaning

A
  • assesses how well a set of independent variables, as a group, explains the
    variation in the dependent variable
    or
  • evaluate whether independent variable explain the variance of dependent variable

o Ho: b1 = 0; Ha: b1 # 0
o One-tailed test
o F = MSR/MSE
o Critical value: depend on the level of significant and and 2 degrees of freedom: k and n – k – 1
o Reject Ho nếu F-statistic > Critical value

19
Q

Hypothesis Test of a Regression Coefficient

A

o Slope coefficient (b1^): Tb1 = (b1^ - b1)/Sb1
o Pair-wise correlation (p)
o Intercept (bo^)
o Two-tailed statistic

20
Q

Confidence Intervals for Predicted Values

A

o Y^ +- (tc x st)
o tc = two-tailed critical t-value at the desired level of significance with df = n-2
o sf = standard error of the forecast

21
Q

Log-lin model

A

This is if the dependent variable is logarithmic, while the independent variable is linear.

22
Q

Lin-log model

A

This is if the dependent variable is linear, while the independent
variable is logarithmic

23
Q

Log-log model

A

Both the dependent variable and the independent variable are
logarithmic.