Chapter 18 Predictive Modeling Project Template Flashcards
WHAT ARE THE STEPS FOR APPROACHING ANY PREDICTIVE MODELING ML PROJECT? P115 (Project Template P116)
Define The Problem Summarize Data Prepare Data Evaluate Algorithms Improve Results Present Results
WHAT ARE THE THINGS WE NEED TO DO TO PREPARE DATA? P117
Cleaning data by removing duplicates, marking missing values and even imputing missing values
Feature selection where redundant features may be removed and new features developed.
Data transforms where attributes are scaled or redistributed in order to best expose the structure of the problem later to learning algorithms
HOW CAN WE IMPROVE OUR MODELS? P118
Search for a combination of parameters (Tuning) for each algorithm using scikit-learn that yields the best results.
Combine the prediction of multiple models into an ensemble prediction using ensemble techniques.