7.1 Introduction to Linear Regression Flashcards

1
Q

A) What is the purpose of linear regression?

B) What does variation refer to?

A

A) To explain the variation in a dependent variable in terms of the variation in a single independent variable

B) Variation is the degree to which a variable differs from its mean value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A. What is the dependent variable? B. What are we trying to find out?

C. What is the independent variable?

A

A. The dependent variable is the variable whose variation is explained by the independent variable

B. We want to answer the question: What explains fluctuations in the dependent variable?

C. The independent variable is the variable used to explain the variation of the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the sum of the squared errors (SSE)?

A

The sum of the squared vertical differences between the estimated and actual Y-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A. What is the regression line?

B. What does it minimise?

A

A. It is the line that minimises the sum of the squared differences (vertical distances) between the Y-values predicted by the regression line and the actual Y-values.

B. It minimises the SSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A. What is simple linear regression referred to as?

B. Why?

A

The ordinary least squares (OLS) method

The values determined by the estimate regression equation are called least squared estimates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the slope coefficient?

A

It describes the change in Y for a one unit change in X

It can be positive or negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the intercept equation highlight?

A

That the regression line passes through a point with coordinates equal to the mean of the independent and dependent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you interpret the slope coefficient and the intercept?

A

A. The slope coefficient is the amount that the dependent variable will change, given a 1% change in the independent variable

B. The intercept is the value of the dependent variable when the independent variable is zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the four assumptions underlying linear regression?

A
  1. A linear relationship exists between the independent and dependent variables
  2. The variance of the residual terms is constant for all observations (homoscedasticity)
  3. The residual term is independently distributed
  4. The residual term is normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is heteroscedasticity?

A

Where the variance of the prediction errors is not constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is independence important?

A

The residual for one observation is not correlated with that of another observation

If X and Y are not independent, the residuals are also not independent, and our estimates of variance and model parameters will be incorrect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is it important that the residual term is normally distributed?

When is non-normal acceptable?

What is the impact of outliers?

A

When the residuals (prediction errors) are normally distributed, we can conduct hypothesis testing to evaluate the goodness of best fit.

Non-normal is acceptable with a large sample size

Outliers will influence our parameter estimates so that the OLS model will not fit the other observations well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the formula for variation in Y? (Y = dependent variable)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is the slope coefficient calculated?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is the intercept calculated?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the simple linear regression model?

A
17
Q

What is the linear equation for the line of best fit?

A