Important 2: Logistic Regression and Multinominal Logit Models Flashcards
What is the Multinominal Logit Model?
is a choice model where the consumer chooces between j alternatives
–>Here as well as in the binary case we assume that the alternative which yields highest utility is chosen
What is the Independence of Irrelevant Alternative (IIA) Property?
Multinominal Logit Models
Multinominal Logit Models suffer from the IIA, this property states
that the odds of choosing one alternative over another are constant regardless of whcih other alternatives are present
Why is the IIA property not realistic in marketing?
- In many marketing applications this is not realistic especially if some alternatives are close substitutes
- If similarities across alternatives are incorrectly assumed , the estimated effects of marketing variables are incorrect
How to deal with the Independence of Irrelevant Alternaitves
- perform statistical test of IIA e.g. attraction model
- use choice model that explicitly accounts for the fact that consumer choice behavior is affected by the composition of the choice set
–>Other models that eliminate the IIA assummptions are the Nested Multinominal Logit Model and the Multinominal Probit model
What do Markov Chain model?
model behavior over time and consists of several observable (behavior, corresponding profits/losses) and unobservable (probabilities) factors
model how sequence of observations is related to transitioning among states
Markov Chain models define set of customer states based on…
The idea is to define the set of customer states based on:
- observed customer properties, e.g. the purchase of a product
- estimation of transition probabilities between the different states
- corresponding profits and losses
4 Probabilities of Markov Chain Models
- Transition probability: probability of going from one hidden state to another
- Emission probability: probability that observations are emitted from the hidden states ( prob observed outcome given particular state int he system
- Prior probability: basic likelihood for hidden state (prob that a random customer has a preferecne for brand A or B)
- Posterior probability: likelihood for hidden state given observation (prob that a customer prefers Brand A or brand B given that he buys brand A)
What is the goal of Markov Chain Models?
Capture….
Our goal is to capture dynamics in customer behavior over time, for
example, to see how firm interventions influence this behavior
For what do we use Logistic regression?
is used to predict binary or categorical outcomes (buy or not buy)
–>are binary choice models*
What are the two variables in logistic regression?
depdent variable: which is the variable of interest that we want to predict or explain
predictor variable/independent variable: are the variables used to explain or predict the depdent variable, these can be categorical but can also be continous (numerous)
How to infer prefercenes
(Markov Chain models)
Multiply every probabillity along the path (Viterbi Algorithm)
–>Machine learning system which identifies path with highest probability to be chosen
- finding most profitable preference combination
- If preference probabillities are close to each other advertising makes sense
What is necessary to check in the data, when doing Logistic regression?
–>used for binary dependable variables
- check whether there are characters turn them into factor levels
–>if indepent variable is character just check if the second or last entry is also the best if there is a best, for example if (Bundle, No Bundle –>here Bundle 1 and no Bundle 2
What is an interaction effect?
How is it incorporated in glm?
Interaction effect: refers to the combined effect of two or more predictor variables on the outcome
incorporated by: either Offer:Channel or by Offer*Channel —>always in reference to base in this case E-Mail
What happens in logistics regression when there are multiple predictiors? What is the baseline?
glm(purchase ~ coupon + Channel +..)
for Channek: E-Mail, Mail, Park
automatically glm takes the first variable –>E-Mail as baseline
–>all other variables in Channel are compared to the baseline
–>the
What is the meaning of the ods ratio?
5.6
How is this measure called?
Customer are 5.6 times more likely to purchase product when they receive a coupon (OfferBundle)
odds ratio is the association between variiables