predictive analytics; classification Flashcards
qualitative outcome
binary variable, yes or no, represented by numerical values 1 and 0
regression does not work for classification because
betas can push predicted outcomes past its bounds (0-1)
dummary/binary variable
way of transforming categorical variable in numerical values, and can often be binary variable
dependent and independent variables in classification
dependent = binary
independent = can be continuous
eg y = fruit, x = height, width
to read a classification tree
outcome on top node is overall prediction
- 0.smth under this represents probability that the outcome (throughout tree) is whatever was decided by the tree
- percentage under this represent proportion of data that fits into ctieria
accuracy rate
correct predictions/number of observations