5 - Introduction to Regression Flashcards

1
Q

What is regression?

A

A way of predicting the value of one variable from another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a regression model?

A

A hypothetical model of the relationship between 2 variables (usually, both are interval scale variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is linear regression?

A

A linear approach to modeling the relationship between 2 variables using the equation of a straight line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the Ordinary Least Squares approach?

A

An approach that minimizes the error of the sum of squares between the predictions made by the model, and the observed data (minimizing the sum of squares).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is it important to test the model?

A

The regression model is only based on the data, which means that the model may not reflect reality. We need to test how well the model fits the observed data and how well it predicts on new data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the sum of squares used for?

A

To tell us how well the model fits the observed data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 3 types of sum of squares?

A

Total Sum of Squares, Residual Sum of Squares, Model Sum of Squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the Total Sum of Squares (SST) tell us?

A

Variability between the observed dependent variable and its mean.
(aka. the total variance in the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the Residual Sum of Squares (SSR) tell us?

A

Variability between the regression model and the actual data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the Model Sum of Squares (SSM) tell us?

A

The model variability, so the difference in variability between the model and the mean
(aka. how well our line fits the data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How are SST, SSR, and SSM related?

A

SSM = SST - SSR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is ANOVA used in Linear Regression?

A

An ANOVA test can be used to evaluate a regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What should we expect if a model is a better tool for predicting than using the mean?

A

SSM should be much greater than SSR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

MAKE A QUESTION ABOUT THE MEAN SQUARED ERROR

A

F = (Mean Squared Error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is R^2 used in Linear Regression?

A

R^2 (Pearson Correlation Coefficient Squared) tells us the proportion of variance accounted for by the regression model. In other words, what variability in the dependent variable is explained by the independent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you run a linear regression test in R?

A

lm(formula, data)