Chapter 8: Introduction To Linear Regression Flashcards

1
Q

What is the goal of linear regression?

A

To model the relationship between two numerical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the regression line equation?

A

ŷ = b₀ + b₁x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the slope (b₁) represent?

A

The change in the response variable for a one-unit change in the explanatory variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the intercept (b₀) represent?

A

The predicted value of the response variable when x = 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a residual?

A

The difference between the observed value and the predicted value (eᵢ = yᵢ - ŷᵢ).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What method is used to find the best-fitting line in linear regression?

A

Least squares method.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why do we use squared residuals instead of absolute values?

A

Because they are easier to compute and penalize larger errors more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the conditions for using linear regression?

A

Linearity, nearly normal residuals, and constant variability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is homoscedasticity?

A

When the variability of residuals is approximately constant across all x values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does R² represent?

A

The proportion of variability in the response variable explained by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is R² calculated?

A

As the square of the correlation coefficient (R² = r²).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of inference for the regression slope?

A

To test if there is a significant linear relationship between x and y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the null and alternative hypotheses for slope inference?

A

H₀: β₁ = 0; Hₐ: β₁ ≠ 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the test statistic formula for regression slope inference?

A

T = (b₁ - 0) / SE_b₁

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the degrees of freedom for the regression t-test?

A

df = n - 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you interpret the slope in context?

A

For each 1-unit increase in x, y is expected to change by b₁ on average.

17
Q

What is extrapolation in regression?

A

Making predictions outside the range of observed data.

18
Q

What is an outlier in linear regression?

A

A point that lies far from the others.

19
Q

What is a high leverage point?

A

A point far from the center of the x-values.

20
Q

What is an influential point?

A

A point that significantly affects the slope of the regression line.