Topic 6: Ensemble Theory Flashcards

Question 1

Q

Bias-Variance-Diversity decomposition

Answer

A

Expected risk (ensemble) = noise + bias + variance - diversity

Question 2

Q

What is Diversity

Answer

A

ESn [ 1/m mΣi=1 (fi(x) - f¯(x))^2 ]

Difference between models
diverse models make different errors on new data points

compares prediction made my ith model fi(x) and the average over all models

Question 3

Q

How does diversity help in ensembles

Answer

A

computational efficiency
robustness against adversarial attacks
improved performance in various applications

Question 4

Q

What is centroid * (circle above term)

Answer

A

Can refer to any of: arithmetic mean, harmonic mean, etc
Represents the centre of the model distribution
It is averaging over all possible data sets (infinite)

Question 5

Q

Generalised Bias-variance decomposition

Answer

A

ED [ EXY [ ℓ(Y,q)] ] = EX [ EY|X [ ℓ(Y,Y)] ] + ℓ(Y*,q◦) + ED[ ℓ(q◦, q)
Expected risk = noise + bias + variance

{z }

Question 6

Q

Generalised ambiguity decomposition

Answer

A

ℓ(y, q¯) = 1/m Σ ℓ(y, qi) - 1/m Σ ℓ(q¯, qi)
Ensemble loss = average loss - ambiguity

Question 7

Q

What is q¯

Answer

A

The ensemble combination
Eg for squared loss ->arithmetic mean
for KL -> normalised geometric mean

Question 8

Q

How does regularisation effect variance and bias

Answer

A

It moves models around in the variance bias axes
It can decrease variance and may increase bias
Eg Linear + reg = lower var, higher bias

Question 9

Q

Larger v smaller networks and diversity

Answer

A

Larger networks tend to perform better due to lower bias and variance, despite potentially lower diversity(capture more complexity and overfit) compared to smaller networks

Question 10

Q

How does diversity interact with bagging

Answer

A

Random Forests initially underperform Bagging but catch up as the ensemble size increases
Random Forests exhibit higher variance-effect but compensate with higher diversity-effect in larger ensembles

Question 11

Q

What is the centroid combiner for poisson regression loss

Answer

A

geometric mean

Question 12

Q

What is the centroid combiner for KL divergence

Answer

A

normalised geometric mean

Question 13

Q

What is the centroid combiner for itakuro saito loss

Answer

A

harmonic mean

Question 14

Q

what is the centroid combiner for squared loss

Answer

A

arithmetic mean

Question 15

Q

single model vs ensemble model tradeoffs

Answer

A

In single models we have a 2-way tradeoff (bias/variance)

In ensembles of models, it’s a a 3-way tradeoff (bias/variance/diversity)

But it only holds if we use the centroid combiner rule

Question 16

Q

What is the key of diversity decomposition

Answer

A

it is not the task (e.g., classification/regression) that matters, but the loss function
it is a “hidden dimension”

Question 17

Q

What is diversity dependent on

Answer

A

The label (due to combiner)

Question 18

Q

How is diversity always related to expected risk

Answer

A

diversity always subtracts from the expected risk
But it only reduces it given a fixed bias and variance

Question 19

Q

what is y bar

Answer

A

average prediction made by all models in the ensemble for a given input x

Question 20

Q

does bias-variance decompositions hold for all losses

Question 21

Q

difference between q bar and q circle

Answer

A

q bar
represents the simple average
m∑i=1 qi
used in diversity

q circle
represents expected value, takes into account probabilities and weights
used in bias and variance

Brainscape's Knowledge GenomeTM

Topic 6: Ensemble Theory Flashcards

Brainscape's Knowledge Genome^TM