Dimension Reduction Flashcards

Question 1

Q

Reasons for dimension reduction

Answer

A

Computational Cost
Financial Cost
Interpretability

Question 2

Q

Explain the Filter strategy for feature selection

Answer

A

Its a pre-processing step that ranks and filters features independently of the choice of classifier. It assigns a score to the different feature subsets using an evaluation function of choice

Question 3

Q

What is a good strategy for selecting the top features from a Filter

Answer

A

evaluate classifier performance using feature subsets of increasing size

Question 4

Q

Disadvantages of Filters

Answer

A

No model bias: doesn’t account for feature suitability across models
No feature dependencies

Question 5

Q

Explain the Wrapper strategy for feature selection

Answer

A

The classifier is “wrapped” in the feature selection mechanism. Feature subsets are evaluated directly based on their performance when used with that specific classifier

Question 6

Q

Advantages of Wrappers

Answer

A

Accounts for bias
considers features in context (feature dependencies)

Question 7

Q

List the types of search used in feature subset search

Answer

A

exhaustive
exponential (heuristics)
sequential (add/remove one f at a time)

Question 8

Q

Describe the steps of Forward Sequential search

Answer

A

Start with an empty subset
find the most informative feature and add it to the subset
Repeat until there us no improvement by adding features

Question 9

Q

Describe the steps of Backward Elimination

Answer

A

Start with the complete set of features
remove the least informative feature
repeat until there is no improvement by dropping features

Question 10

Q

Compare Forward Sequential Search (FSS) to Backward Elimination (BE)

Answer

A

FSS requires less running time if completed early
BE tends to find better models, can find subsets with interacting features, but tends to be slower

Question 11

Q

Disadvantages of Wrappers

Answer

A

Computational cost
risk of overfitting

Question 12

Q

What is the general idea of projection methods

Answer

A

They are used in feature transformation to map the original d-dimensional space to a new (k-d)-dimensional space, with the minimum loss of information

Question 13

Q

What is Principal Component Analysis (PCA)

Answer

A

an unsupervised projection method which aims to keep as much of the variance in the data as possible

Question 14

Q

What are principal components in PCA

Answer

A

new dimensions constructed (from eigenvectors) as linear combinations of the original features, which are uncorrelated with one another.
The first PC accounts for the most variability in the data, and so on…

Question 15

Q

Give context as to what eigenvectors and eigenvalues are

Answer

A

Given a matrix X, an eigenvector of the matrix is a non-zero vector v that satisfies the equation: Xv = λv, where λ is the eigenvalue.

Question 16

Q

What is eigendecomposition

Answer

A

the factorisation of a matrix into its eigenvalues and eigenvectors

Question 17

Q

Eigenvectors in the context of PCA

Answer

A

orthogonal matrices which have direction
each eigenvalue represents the variance in its direction

Question 18

Q

What is a Covariance Matrix in PCA

Answer

A

Symmetric matrix which measures the tendency for features to vary in the same direction

Question 19

Q

Give the steps of the PCA Algorithm (given dataset matrix X with n examples)

Answer

A

Calculate the mean of the columns of X
Subtract the column means from each row of X, to create the centred matrix Y
Calculate the Covariance matrix C
Calculate the eigenvectors of C
The PCs are given by the eigenvalues
Select PCs as a new reduced representation of X