Linear models refresher Flashcards

1
Q

What are deterministic models?

A

Given the same input, functions will return exactly the same output
e.g. y = mx +c

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are statistical models?

A

they contain some pattern but there is variation around this pattern
e.g. given someone’s age we could attempt to predict their height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are linear models?

A

They are a type of statistical model

outcome = (model) + error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interaction terms

A

These occur when we can’t control for factors, so we include interaction terms to try and control for the effect on y
“as we move up 1 unit in x1, the slope of x2 changes by…”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is standard error?

A

how much we estimate our sample will vary due to random error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the p-value

A

it is the likelihood of obtaining a result equal to or more extreme than one of our observations if the null is true
We use this to evaluate our null hypothesis
The smaller the p-value, the less likely the null is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the sums of squares residual (SSresidual)?

A

the difference between the model line and each observed value of y
(y - y hat)
allows us to consider the model as a whole - rather than as specific coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the sums of squares total (SStotal)?

A

the difference between the mean of y and each observed value of y
(y - y bar)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the sums of squares model (SSmodel)?

A

the difference between the model line and the mean of y
(y hat - y bar)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is R squared?

A

= how much variance in y is due to our predictors (we want this to be high)

R^2 = SSmodel / SStotal
R^2 = 1 - ( SSresidul / SStotal )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are joint tests?

A

they allow us to make inferences about the improvement of model fit with the inclusion of additional parameters (this allows us to test the reduction in SSresidual)

in R this is what the anova() function is for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the assumptions of a linear model?

A

Our residuals (y - y hat) reflect everything we don’t account for in our models - in an ideal world these residuals would be just like random noise

We check how much ‘like randomness’ the residuals appear to be:
- mean of 0
- normal distribution
- constant variance ( = no detectable plotted pattern)
- independent and identically distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What to do if our model does not meet assumptions:

A
  • check if the model is mis-specified
  • transform the outcome variable?
  • bootstrap?

these will not help if the independence assumption is violated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly