Statistical Learning Methods Flashcards

1
Q

Bagging

A

A type of random forests that does the same thing but uses all features at each split because m = p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Random Forests

A

using x number of bootstrapped training sets to construct x number of regression trees using a subset of features at each split then averaging the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Logistic Regression

A

Supervised machine learning algorithm widely used for binary classification tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Boosting

A

Boosting improves machine models’ predictive accuracy and performance by converting multiple weak learners into a single strong learning model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Weak learners

A

Models with low prediction accuracy, similar to random guessing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Strong learners

A

Models with high(er) prediction accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

hierarchal clustering

A

Unsupervised learning method for clustering data observations. Clusters are built by measuring dissimilarity between data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

K means clustering

A

Divides observations into K number of clusters, each observation being assigned to the closest centroid of a cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ridge regression

A

Method of estimating the coefficients of multiple regression models in scenarios where predictors are highly correlated.

Shrinks coefficients but does not force it to 0, and therefore does not perform variable selection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Lasso

A

Regression analysis method that performs both variable selection and regularisation to enhance prediction accuracy and interpretability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Regularisation

A

A set of methods for overfitted models to increase generalisability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly