POISSON REGRESSION Flashcards
Properties of a poisson distribution:
- Discrete (0, 1, 2, etc.)
- Strictly positive
- No natural upper limit (unlike binomial)
What is the sole parameter of the poisson distribution?
Lambda
- The mean rate of occurrence of the event
What’s an offset and what the use?
- An offset allows to model counts per unit time rather than simply counts
- It’s a variable that is forced to have a regression coefficient of 1
What effect measure can we model with poisson?
Incidence rates
Btw, what effect measure can we model with logistic?
Cumulative incidence in cohort studies with equal follow-up of study participants
3 commonalities between poisson and other regression models
- Outcomes are independent
- A linear model is fitted
- Coefficients are obtained and interpreted in usual manner
4 differences between poisson and other regression models
- It models expected counts (rate)
- The underlying mathematics and underlying probability distribution theory are different
- E(mean) = E(variance)
- No danger with negative predicted values (antilog of negative regression values is between 0 and 1)
What is overdispersion?
When there is a larger variance than what is assumed in a model
Also: observed variance is greater than the mean
4 sources of overdispersion
- Unobserved heterogeneity
- Clustering
- Contagion or diffusion
- Classical measurement error
What is the effect of overdispersion?
The point estimates are accurate but less precise than they say
When would underdispersion occur?
Negative correlations induced by contagion and clustering (rare)
What are two alternatives in the case of overdispersion?
- quasi poisson: assumes var(Y)= theta*u
- negative binomial: assumes Var(Y)=u(1+ ku)
theta = quasi-poisson overdispersion parameter k(1/theta) = shape parameter of negative binomial distribution
What is zero-inflation?
When we have a bunch of 0s in a variable - it addresses both excess zeros and implicitly over-dispersion
What’s up with the maximum likelihood estimator in the case of over-dispersion?
If the mean does not equal the variance,
the mle is consistent
but gives the wrong standard errors
Two types of residuals for poisson?
- Pearson residuals
- Deviance residuals