Chp 3 Provost And Fawcett Flashcards
Supervised segmentation
How can we segment the population into groups that differ from each other with respect to some quantity of interest
Information
Quantity that reduces uncertainty about something
Tree induction
Incorporates the idea of supervised sit on in an elegant manner repeatedly selecting informative attributes
Model
Simplified representation of reality created to serve a purpose
Predictive model
Formula for estimating the unknown value of interest (the target)
Prediction
Estimate an unknown value
Descriptive modeling
Gain insight into the underlying phenomenon or process
Supervised learning
Model creation where the model describes a relationship between a set of selected variables(attributes or features) and a predefined variable called the target variable
Instance/example/row
Represents a fact or data point
Instance/feature vector
Described by a set of attributes(fields, columns, variables, features)
Model induction
Creation of models from data
Training data
Input data for the induction algorithm used for the inducing model
Entropy
A measure of disorder that can be applied to a set
Information gain (IG)
Measures how much an attribute improves (decreases the entropy number) over the whole segmentation it creates
Instance space
Space described by the data features