Generalised Linear Model (week 6-8) Flashcards
Where does GLM used for?
General / health insurance pricing
GLM formula?
g(μ) = g(E(Y)) = α + β1X1 + … + βkXk = η
what does g represent?
link function
what is η
linear predictor
what is μ?
g^(-1) (η)
what is the symbol of dispersion parameter
�
b”(teta) from PDF represents
variance function
what is canonical link
transform mean to natural exponential
Why do we need GLM?
Because when the dist is normal, we use PDF to calc P-val or CI. However, if its normally dist, heteroskedacity, and non-linear, we use GLM
link function
we transforming the predictions, or everything except the dependent var
binomial (binary) follow what dist
logistic regression
when we use poisson?
if we have skewed discrete dist
-“num of time u …”
when to use neg binomial?
mean and median diff, unlike poisson
gamma dist when to use?
continuous dist, var must positive >0
how to do GLM? (long)
- what dist is this?
- look at the table, see which μ are u suing (formula sheet)
- write likelihood function ∏(fy)
- compute log likelihood function change ∏(fy) to ∑log(fy)
- fy is from formula sheet page 5 (dont forget exp can diturunin langsung kalau dikali with log)
- masukan the fy (from number 5) use number 2 μ
- derive alpha and beta and set to 0 (if we derive and hv x infront, the x stay still, gbisa di remove, if dont hv x, we can remove langsung all the alpha beta ))
Information Criteria is
-Assess goodness-of-fit and parameter parsimony
-For comparison between diff linear predictors/link functions
How too choose good IC?
find the lowest one
What are 2 types of CI?
AIC and BIC (more likely underfit)