Book 1_Quan_Simple Linear Regression Flashcards

Question 1

Q

Linear regression definition

Answer

A

provides an estimate of the linear relationship between an
independent variable (the explanatory variable) and a dependent variable (the
predicted variable)

Question 2

Q

The general form of a simple linear regression model

Answer

A

Yi = bo + b1 Xi + ei
+ b1 = fitted slope coefficient = COVxy/stdx^2
+ b0 = fitted intercept = Y – b1 X

+ Dependent variable: Y
+ Independent variable: X

Question 3

Q

The estimated intercept, b0

Answer

A

represents the value of the dependent variable at the
point of intersection of the regression line and the axis of the dependent variable
(usually, the vertical axis)

Question 4

Q

The estimated slope coefficient, b1

Answer

A

is interpreted as the
change in the dependent variable for a one-unit change in the independent variable.

Question 5

Q

Assumptions made regarding simple linear regression include the following:

Answer

A

A linear relationship exists between the dependent and the independent variable.
The variance of the residual term is constant (homoskedasticity).
The residual term is independently distributed (residuals are uncorrelated).
The residual term is normally distributed.

Question 6

Q

Linear Relationship

Answer

A

A linear regression model is not appropriate when the underlying relationship
between X and Y is nonlinear

Question 7

Q

Homoskedasticity

Answer

A

refers to the case where prediction errors all have the same variance

Question 8

Q

Normality

Answer

A

When the residuals (prediction errors) are normally distributed, we can conduct hypothesis testing for evaluating the goodness of fit of the model

Question 9

Q

Outliers

Answer

A

observations (one or a few) that are far from our regression line (have
large prediction errors or X values that are far from the others)

Question 10

Q

Analysis of variance (ANOVA)

Answer

A

a statistical procedure for analyzing the total
variability of the dependent variable.

Question 11

Q

The total sum of squares (SST)

Answer

A

measures the total variation in the dependent
variable

SST = total (Yi – Ymean)^2

Question 12

Q

The mean square regression (MSR)

Answer

A

the SSR divided by the number of independent variables

(MSR) = RSS/k

Question 13

Q

The sum of squares regression (SSR)

Answer

A

measures the variation in the dependent variable that is explained by the independent variable

RSS = total (expected Yi – Ymean)^2

Question 14

Q

The sum of squared errors (SSE)

Answer

A

measures the unexplained variation in the
dependent variable

total (Yi – expected Yi)^2

Question 15

Q

The mean squared error (MSE)

Answer

A

is the SSE divided by the degrees of freedom,
which is n − 1 minus the number of independent variables

(MSE) = SSE/(n-k-1)

Question 16

Q

Total variance formula

Answer

Study These Flashcards

A

total variation = explained variation + unexplained variation
or:
SST = SSR + SSE

Question 17

Q

Standard Error of Estimate (SEE)

Answer

Study These Flashcards

A

is the standard deviation of its residuals. The lower the SEE, the better the model fit:

(SEE) = căn (MSE)

Measure the degree of variability of the actual Y values relative to the estimated Y values from a regression equation

Question 18

Q

Coefficient of Determination (R2)

Answer

Study These Flashcards

A

The percentage of the variation of the dependent variable that is explained by the independent variable

= SSR/SST => Càng lớn thì đường regression càng đúng

Question 19

Q

An F-test meaning

Answer

Study These Flashcards

A

assesses how well a set of independent variables, as a group, explains the
variation in the dependent variable
or
evaluate whether independent variable explain the variance of dependent variable

o Ho: b1 = 0; Ha: b1 # 0
o One-tailed test
o F = MSR/MSE
o Critical value: depend on the level of significant and and 2 degrees of freedom: k and n – k – 1
o Reject Ho nếu F-statistic > Critical value