7. Flashcards
How categorical dependent variable regression can be described?
Categorical dependent variable regressions means we are trying to see how different factors affect if an observation belongs to a particular group.
Where categorical dependent variable regression could be used for?
Create a model that predicts if a company will, or will not, go bankrupt.
Identify which factors increase / decrease the probability that a political party will or not will be elected.
Create a filter that classifies emails into spam and non-spam.
What are different types of caregorical dependent variable regressions?
The linear probability model
The binomial logit model
Problems with the linear probability model
R-squared is innacurate messure
The left side of equation could only be between 0 and 100%
What is the binomial logit model?
The binomial logit model β a regression equation model with a binary dependent variable that uses logs and exponents.
What are advantages of binomial logit model?
Easy to understand
It indicates by how much a change in the independent variables affects the dependent variable
How the binomial logit model works?
The binomial logit model models the probability of the default situation (i.e., π·_π=1).
The output / prediction of the regression equation ranges from 0 to 1.
Given the specific of this approach it does not estimate the regression line using OLS. Instead it uses something called Maximum Likelihood (ML). Because of this, the only thing you should check is collinearity
Decisions to which group an observation should be classified are made based on some decision threshold. For example, if the probability of a default is > 0.5 for an individual, we can say he/she will default, while if it is below, we can say he/she will not.
What are different strategies of interpreting results of binomial logit model?
- Changing an average observation. Create an average observation by plugging the means of all independent variables. Then change one independent variable and calculate how the dependent variable probabilities change.
- Calculating the odds ratio. Odds ratio β compares the probability of π·_π=1 to π·_π=0. It is calculated as follows: π^(π½_π )=(P(π·_π=1))/(P(π·_π=0))
- Partial derivative. The expected value of (π·_π )Μ that is associated with one unit increase in π_1π is equal to π½_1 (π·_π )Μ (1β(π·_π )Μ)
- Dividing coefficient by 4. Dividing the coefficient by 4 provides the upper bound marginal effect of an independent variable
What is a confusion matrix?
Confusion matrix β a table indicating correct and incorrect predictions. Accuracy of itβs prediction is calculated by 1 + 4 / 1 + 2 + 3 + 4
Whatare different alternatives of models?
Probit β uses the standard normal distribution to derive similar predictions as the logit model.
Tobit β used when there are variables that are censored β we only have partial information.
Ordered logit/probit models β a logit/probit model that can be used with an ordered dependent variable.
Multinomial logit model β a logit model that can be used with a categorical variable with many categories.