W4: Poisson Regression Flashcards

1
Q

What if we are interested in a response variable (which is a count) and its association with more than one binary covariate

Which test do we perform?

A

Poisson and negative bionominal regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When is Poisson regression suitable?

A

When we have a response variable in form of count and one or more IV (covariates) which can be discrete or continous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Form of Poisson regression equation

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In Poisson Regression, M in the equation stands for?

A

Mean count for the individual with covariates values X1, X2…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Once we have values of a, b1, b2 we can use Poisson regression equation to make predicts by reversing the log transformation

Prediction:

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Assumptions of Poisson Regression (3)

A
  1. Observations are independent
  2. Disturbition of counts follow a Poisson disturbition
  3. The mean and variance of the model as the same
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which assumption of Poisson regression is often violated?

A

Assumption 3: Mean and variance of the model are the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Assumption 3 of the Poisson regression is often violated
This is when

A

variance of the model is larger than mean –> this is known as overdispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

To assess whether a model is overdispered (Poisson reh) is to look at the

A

Chi-squared test statistic x^2 divided by model’s DF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If Chi-squared (x^2)/DF = 1 then

Poisson/NB

A

mean and variance of the model are equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

If Chi-squared (x^2)/DF > 1 then

Poisson/ NB regression

A

A value much larger than 1 indicated overdispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Assessing model fit using (2)

Poisson/NB reg

A
  • AIC value
  • Omnibus test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the Omnibus test the null hypothesis that:

Poisson/NB test

A

HO = The model is no better than an intercept-only model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In other words, what H0 the Omnibus test test?

Poisson/NB

A

Our Poisson regression with our X variables used to predict this count is no better than guessing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

If the Omnibus test is significant then -

Poisson/NB

A

The model is superior to a null model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Testing significance of predictors using __ test in Poisson regression

A

Wald

17
Q

Testing significance of predictors using Wald test in Poisson regression
We test the null hypothesis that:

A
18
Q
  1. First thing to write up in Poisson regression:
A

A Poisson regression was performed to assess whether X2(, e.g., Spend on campagin ) and X2 (e.g., Emotiveness of advertisement) affected/predicted the number of Y (e.g., number of clicks)

19
Q

Step 2 of Poisson is writing regression equation (in SPSS) - (2)

A

The regression equation took the form of:

log (Mean number of clicks) = 3.0414 + 0.372 (Spend) - 0.717 (Emotive)

20
Q

Step 3 of Poisson signifiance of covariates (in SPSS)

A

Both Spend and Emotive are significant at 0.1% level as predictors of Number of Clicks, and therefore we retain both covariates in the model

21
Q

Step 2 of signifiance of covariates in Poisson regression, what if one covariate is not significant at any level?

A

Therefore this should be removed and refit the model again

22
Q

Step 2 and 3 of Poisson regression of writing equation and sig of covariates in R

A
23
Q

Step 3 of Poisson regression: Testing for overdispersion (in SPSS) from goodness of fit table - (2)

A
  • From the goodness of fit table, we see the Chi-squared test statistic divided by its DF is 57.623.
  • This is much larger than 1 and therefore we have very strong evidence of overdispersion in the data
24
Q

Step 3 of Poisson regression: Testing for overdispersion (in R) from goodness of fit table - (2)

A

Divide 15.295/16

25
Q

Step 4 of Poisson is Omnibus test (in SPSS) - (3)

A
  • The Omnibus test is significant at 0.1% level
  • So model is superior to null model
  • Including X1 (e.g., Stress) and X2 (e.g., Distance) to predict Y (e.g., Sick days) is better than a null model
26
Q

There is no Omnibus test in

Poisson/ Negative NB

A

R

27
Q

Step 5 of Poisson Regression is writing the AIC (in SPSS) - (2)

A

The AIC for this model is 1148.239
Cna be used to compare to other models

28
Q

Step 5 of Poisson Regression is writing the AIC (in R)

A
29
Q

If we have overdispersion in the data, then Poisson regression is not suitable
Instead we use:

A

Negative bionominal regression

30
Q

If we have lots of zero counts then zero-inflated Poisson regression is

A

suitable

31
Q

The negative bionominal regression equation is same in (5)

A
  • The steps of Poisson regression
  • Same interpreting AIC value and overdispersion
  • Same in extracting values for equation
  • Same regression equation as Poisson regression
  • Same assumptions
32
Q

In negative bionominal regression we only need to compare the

A

AIC value to other model (i.e., Poisson regression model)

33
Q

What is AIC?

Negative / Poisson Reg

A

Measure of goodness-of-fit of regression models

34
Q

What is the preferred model based on AIC?

A

Is one with lowest AIC value

35
Q

Assumptions of negative bionominal regression are: (2)

A

Observations are independent
The counts follow a negative bionominal disturbition

36
Q

Two assumptions of negative bionominal regression is satisfied

A

when creating scatterplot of predicted values against residuals

37
Q

If negative bionominal regression scatterplot look like this then - (2)

A

We see a pattern here: variance decreases (range of points lower/less spread of points in residuals) as predicted values increase
This provides evidence against our assumptions

38
Q

Check indepndence observation in Poission regression by

A

plotting a scatterplot of predicted values against residuals

39
Q

Checking indepndence assumption of Poisson in this graph in R, interpret it - (2)

A
  • we don’t have independence in our observations so assumption of independence might not be met as we have got this big gap in the middle of this plot
  • not obvious why there is this gap