Feature Selection Flashcards

1
Q

The process of identifying small, but informative feature set.

A

Feature selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Three categories of feature selection methods.

A

Filters, embedded methods and wrapper methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Feature selection method in which each feature is scored with their impact on the target variable and subsequently the ‘best’ ones are selected.

A

Filter method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In this feature selection method, feature selection is part of a ML model’s internal model fitting procedure.

A

Embedded method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

This feature selection method is based on a subset selection procedure, which is based on a loop around the ML model (similar to HPO).

A

Wrapper method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

(True or false) Some filter methods also possess hyperparameters.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Three learners that can also compute feature scores internally.

A

ranger, rpart and xgboost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Likely the most flexible, though also most expensive approach

A

Wrapper approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

(True or false) The automated feature selector can be used like any other learner.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

(True or false) Feature selection can improve a model’s performance, but there is no guarantee.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Desirable properties of a feature for feature selection

A

Relevant and non-redundant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A filter method that works on the basis of mutual information 𝐼(𝑥; 𝑦) between two (discrete) random variables 𝑋, 𝑌 with distributions 𝑝𝑋, 𝑝𝑌 and 𝑝𝑋,𝑌

A

mRMR (minimum redundancy, maximal relevancy)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The two types of sequential feature selection

A

Forward and backward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Permuted versions of the original variables/features (across observations)

A

Shadow variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

(True or false) Shadow variables are uncorrelated to the target.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

An all-relevant feature selector, builds upon random forest (permutation) importance and shadow variables.

A

Boruta