Week 6: Linear Regression 1 Flashcards

1
Q

What is the difference between the independent and dependent variable?

A

Dependent variable = outcome, response variable
Independent variable = Control, explanatory, input variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does correlation measure?

A

Measures the degree of linear association between 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 3 steps to describe how a regression works?

A

1) Develop an estimating equation (y = f(x)), and learn the pattern of the relationship
2) Determine the degree to which the variables are related
3) Examine how well the estimating equation describes the relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the standard estimating equation?

A

y = mx + c

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to determine best fit line?

A

SSR, similar to SSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why do we square the errors?

A

1) To magnify larger errors
2) To cancel the effect of positive and negative values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you calculate SST and SSR?

A

SST = (y_actual - y_mean)^2
SSR = (y_hat - y_mean)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is R^2 and how do you calculate it

A

Coefficient of determination. Calculated by taking SSR/SST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some possible ways to assess relationships?

A
  • Graphical visualizations like scatter plots
  • Correlation coefficient
  • Linear regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the types of relationships between 2 variables?

A

1) Strong, positive/negative linear
2) positive/negative linear
3) Perfect positive/negative linear
4) Parabolic
5) Curvilinear (Exponential)
6) No relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you calculate the total variation of error?

A

Explained + Unexplained variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is R^2?

A

a statistical measure that represents the proportion of variance in the dependent variable that can be explained by the independent variable(s) in a regression model. In other words, R-squared is a measure of how well the regression line fits the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly