Model Selection Flashcards
What is a fixed factor?
An explanatory variable where the level of the explanatory variable is meaningful.
If we wish to draw inferences about the effects of that particular level of the explanatory variable on the response variable we can
The factor is completely repeatable at all levels of the explanatory variable
What is a random factor?
An explanatory variable where the level of the explanatory variable is not meaningful.
E.g fishes in a population
Not exactly repeatable
What do linear models assume?
Main factors impact the outcome in a predictable way and all other variation is due to error.
This assumes independence of errors (errors are distributed independently throught the data set).
What do linear models assume?
Main factors impact the outcome in a predictable way and all other variation is due to error.
This assumes independence of errors (errors are distributed independently throught the data set).
When is independence of errors violated?
1) If you have repeated measurements from different biological subjects the effect of random differences between these subjects will not be distributed independently throughout the data set.
2) if the experimental design is nested random differences at higher levels of nesting will not be distributed independently throughout the data set.
What is independence of errors?
Errors are distributed independently throughout the data set
Why is taking into account of nesting important?
If a biological individual was nested within a group. The randomness of that individual may skew results of observations from that group if that randomness isn’t accounted for in the model.
What are mixed models?
Models that allow us to include both random and fixed explanatory variables.
Why are mixed models useful?
They allows us to fit models which accurately account for different sources of variation in the data set.
What is used to determine the importance of different factors in mixed models?
Likelihood ratio test
What is the likelihood of a model?
The probability of observing our data given the model.
These tell us if models are different from one another.
(Useful when you compare likelihoods between models)
What is a better likelihood score 15 or 20?
20
How do would you know if removal of a explanatory variable from the mixed model has an effect and the explanatory variable is important?
If p-value of likelihood ratio test (comparison of original model and model with removed variable) is <= 0.05 then the explanatory variable you removed is important.
What is a random intercepts model?
A model that assumes intercepts account for random differences between a variable and the slope is constant.
What is a random slopes and intercepts model?
A model where Random effects from person to person are captured by gradients as well as intercepts.