Ensemble Learning Algorithms With Python Flashcards
WHAT ARE THE REASONS WE USE ENSEMBLE ALGORITHMS? P6
Reliability: they can reduce the variance of the predictions
Skill: Ensembles can achieve better performance than a single model
WHAT ARE THE CLASSES IN SCIKITLEARN.ENSEMBLE FOR USING BAGGING IN DIFFERENT TYPES OF PROBLEMS? P7
BaggingClassifier and BaggingRegressor
DOES RANDOM FOREST USE ALL THE FEATURES FOR MAKING ITS DECISION TREES? EXPLAIN HOW IT OPERATES P9
No, it randomly chooses a subset of features for each tree and considers only them at each split point
WHAT IS THE PARAMETER FOR CHOOSING THE NUMBER OF FEATURES FOR RANDOM FOREST TO CONSIDER AT EACH SPLIT CALLED?
max_features parameter
WHAT IS THE BASE MODEL FOR ADABOOST? WHAT ARE ITS PROPERTIES? P11
Decision tree (but since they are very short, they’re called decision stumps, they make a single decision on one input variable before making a decision)
WHAT IS EXPECTED FROM THE MODEL WHEN WE ADD MORE ENSEMBLE MEMBERS? P11
New models attempt to correct the errors made by prior models already added to the ensemble. As such, the more ensemble members that are added, the fewer errors the ensemble is expected to make
WHAT ARE THE TWO MORE EFFICIENT IMPLEMENTATION OF GRADIENT BOOST CALLED? P13
XGBoost and LGBM
WHAT IS THE MAIN DIFFERENCE BETWEEN VOTING AND STACKING? P17
In stacking, a final estimator is used to learn how to best combine the predictions of the base models.
WHICH MODELS ARE USUALLY USED AS THE FINAL ESTIMATOR IN STACKING ENSEMBLE? P17
Linear models: Linear regression and Logistic regression but can be any model in general