lecture 9 Flashcards by V.I.N E.S.H

What are decision trees used for?

Classifying data by recursively splitting it based on feature values.

How well did you know this?

Not at all

Perfectly

What is an internal node in a decision tree?

A decision point that tests a feature to split data.

How well did you know this?

Not at all

Perfectly

What is a leaf node in a decision tree?

A terminal node that assigns a class label.

How well did you know this?

Not at all

Perfectly

Why are decision trees prone to overfitting?

They can learn noise and irrelevant patterns in training data.

How well did you know this?

Not at all

Perfectly

What is the primary advantage of decision trees?

They are easy to interpret and visualize.

How well did you know this?

Not at all

Perfectly

What is a common method to regularize decision trees?

Limiting tree depth or pruning unnecessary branches.

How well did you know this?

Not at all

Perfectly

What is the standard algorithm for training decision trees?

The ID3 (Iterative Dichotomiser 3) algorithm.

How well did you know this?

Not at all

Perfectly

What is the greedy strategy in decision tree learning?

Choosing the best split at each step without backtracking.

How well did you know this?

Not at all

Perfectly

How do decision trees handle missing values?

They can assign the most common value or split on other features.

How well did you know this?

Not at all

Perfectly

What is entropy in decision trees?

A measure of uncertainty in a dataset.

How well did you know this?

Not at all

Perfectly

What is information gain?

The reduction in entropy after splitting on a feature.

How well did you know this?

Not at all

Perfectly

What is the Gini index?

A measure of how pure a split is in decision trees.

How well did you know this?

Not at all

Perfectly

What is overfitting in decision trees?

When a tree is too complex and memorizes training data instead of generalizing.

How well did you know this?

Not at all

Perfectly

What is pruning in decision trees?

Removing branches that do not improve generalization.

How well did you know this?

Not at all

Perfectly

What is an ensemble model?

A combination of multiple models to improve performance.

How well did you know this?

Not at all

Perfectly

What is bagging in ensemble learning?

Study These Flashcards

Training multiple models on different subsets of the data and averaging predictions.

What is the key idea behind random forests?

Study These Flashcards

Building multiple decision trees with random feature selection to improve generalization.

What is boosting in ensemble learning?

Study These Flashcards

A method that trains models sequentially, giving more weight to misclassified instances.

What is a weak learner in boosting?

Study These Flashcards

A model that performs slightly better than random chance.

What is AdaBoost?

Study These Flashcards

A boosting algorithm that combines weak learners to create a strong classifier.

What is gradient boosting?

Study These Flashcards

A boosting method that optimizes a loss function by adding models sequentially.

How does boosting differ from bagging?

Study These Flashcards

Boosting trains models sequentially, while bagging trains models independently.

What is the role of decision stumps in boosting?

Study These Flashcards

They serve as simple weak learners in boosting algorithms.

Why are ensemble models often better than single models?

Study These Flashcards

They reduce variance and improve generalization.

What is out-of-bag error in bagging?

The error estimated on samples not used in training each model.

How do random forests differ from standard decision trees?

They use multiple trees with different random feature subsets.

What is the main disadvantage of ensemble methods?

They are less interpretable compared to single decision trees.

What is feature importance in decision trees?

A measure of how much a feature contributes to making decisions.

What is the takeaway from decision trees and ensemble learning?

Combining multiple decision trees using ensemble methods improves model robustness and accuracy.

lecture 9 Flashcards

(29 cards)