Models for Count data Flashcards

Question 1

Q

Count variable

Answer

A

An ordinal variable that takes non-negative and discrete values: 0, 1, 2, 3, etc.

Question 2

Q

Types of models that can be used on count data:

Answer

A

1) Poisson regression
2) Negative binomial regression
3) Truncated poisson/negative binomial regression
4) Zero inflated poisson/negative binomial regression

Question 3

Q

Examples of count variables:

Answer

A

# of cars owned
# of drinks consumed at festival
# of products returned
# of complaints
# of stocks owned

Question 4

Q

Why not standard regression OLS?

Answer

A

Cause it assumes a normal distribution, but also:

- often very low mean (> 10, it would be appropriate)

Question 5

Q

Poisson distribution

Answer

A

with parameter landa
uses on parameter, since mean=variance
if landa > 10, probably normal distribution so use OLS

Question 6

Q

Poisson regression analysis

Answer

A

DV = Count variable
Goal to explain DV by set of Xi
Each Yi is randomly drawn from a poisson distribution
Mean = variance = landa
Outcome metric is related to Xi via link function

Question 7

Q

Link function

Answer

A

Yi = exp(XiB)

Question 8

Q

Model estimation, via..

Answer

A

Maximum likelihood estimation, the parameters will be estimated by searching for those parameters values that give the highest likelihood to observe the data.

Question 9

Q

Interpretation of the parameters

Answer

A

If Xi changes with one unit (keeping all constant) the expect count (landa) is multiplied by exp(b1)
Or.. if you use the LN of a Xi, parameter becomes elasticity. So increase in X leads to a % increase in landa (DV)

Question 10

Q

Model fit and selection

Answer

A

1) Likelihood
2) Likelihood ratio test -> Only for nested models
3) AIC, BIC, CAIC -> models may differ, however data should be the same.

Question 11

Q

Same violations or “cHaLLaNgEs”

Answer

A

1) Mean not equal to the variance
2) Zero events cannot be observed
3) More zeros then expected

Question 12

Q

Under dispersion

Answer

A

Mean > variance

Almost never happens. If so still use poisson regression

Question 13

Q

Over dispersion

Answer

A

Mean < variance

Use negative binomial regression.

Question 14

Q

Dispersion test

Answer

A

If significant, assumption is violated. So the poisson distribution is not suitable.

Question 15

Q

What to do if Zero events cannot be observed?

Answer

A

Use a truncated model. For the truncated negative binomial regression the second intercept represents the extra variable.

Question 16

Q

When is observing Zeros not possible?

Answer

Study These Flashcards

A

1) Basket size of online orders
2) Relationship length
3) Household size

Question 17

Q

What to do if there are more Zeros then expected?

Answer

Study These Flashcards

A

Investigate whether there is a peak at zero. If so? Two options:

1) Zero inflated models
2) Hurdle models or zero-altered models.

Models for Count data Flashcards

(17 cards)