SUL Topic 5b - Random Forest Flashcards

1
Q

Ensemble Models

A

Aggregation of multiple models where final prediction combines component model predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Random Forest

A

Ensemble learning method combining multiple decision trees for improved accuracy and stability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Decision Tree Limitations

A

Overcome by random forests’ combination of simplicity and flexibility

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Bagging

A

Technique using bootstrap data sets and aggregating predictions to enhance predictive power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Out-of-Bag Error Rate

A

Metric for assessing random forest accuracy and generalization to unseen data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Hyperparameter Tuning

A

Process of optimizing model performance by adjusting variables considered at each step

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Continuous Improvement

A

Ongoing refinement of random forest techniques to expand capabilities and applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Forest Algorithm

A

Sampling of rows and columns at each step, producing more diverse trees than bagging

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Advantages of Random Forests

A

Automatic handling of missing values
Better prediction
Natural safeguard against overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bootstrap Sample

A

Training data used to construct an individual tree in a random forest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Out-of-Bag Sample

A

Training data excluded during the construction of an individual tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variable Reduction

A

Automatic process in random forests that requires less data preparation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Honest Assessment

A

Natural form provided by sampling and bagging in forests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Interpretation Challenge

A

Random forests are difficult to interpret but serve as an ideal model for comparison

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Iterative Testing

A

Process of adjusting settings to select the most accurate random forest model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Missing Data Handling

A

Capability of random forests to automatically address incomplete datasets

16
Q

Clustering in Random Forests

A

Technique explored to expand model capabilities and applications

17
Q

Predictive Accuracy

A

Often improved in random forests due to diverse tree ensembles

18
Q

Model Comparison

A

Random forests serve as a benchmark for other models’ performance

19
Q

Validation Data Assessment

A

Recommended practice in large data scenarios despite forest safeguards