Lecture 7 Flashcards

Question 1

Q

What kind of distribution do real valued EDAs require?

Answer

A

practically useful, such as normal distribution

Question 2

Q

What is ML? (in EDA)

Answer

A

Maximum Likelihood

Question 3

Q

What is the limitation of ML?

Answer

A

Can only model linear-like dependencies.

Question 4

Q

What is the difference between ES and normal based EDA?

Answer

A

ES uses normal distributions for self adaption of models. The model updates implicitly through selection and random mutation

normal based EDA Explicitly couples population to model-update rules by performing estimation on the direction of improvements.

Question 5

Q

When can the direct use of ML-normal in a EDA have a positive result

Answer

A

The Function is unimodal (one peak)
The function is centered at origin
Easy to converge towards minimum

Question 6

Q

What is the big downside of using direct ML-normal in EDA

Answer

A

Structure of solution is very complicated and hardly matches normal distribution
Improving directions are ignored in MLE
The EDA does not observe the direction of the population
Hence, no exploration ouside of the data range, so real optimum can easily be missed.

Question 7

Q

What would we expect to observe over multiple generations of Direct ML normal on EDA?

Answer

A

The algorithm tries to find a distribution that best fits the observed data, rather than the best solution in solution space. (skewed initial data stays skewed)

Question 8

Q

Explain the premature convergence problem

Answer

A

In direct ML-normal on real valued EDA, the variance of the normal distribution estimation will convergence to 0 very fast before the search space has been explored.

Question 9

Q

Why is Gradient Hybridization not a great solution for real values EDAs?

Answer

A

It requires gradient information, which is not always possible in complex problems, and not always reliable.

Question 10

Q

What are the three ingredients for Adaptive ML estimation?

Answer

A

Adaptive Variance Scaling (AVS)
Standard Deviation Ratio (SDR)
Anticipated Mean Shift (AMS)

Question 11

Q

What is AVS?

Answer

A

Adaptive Variance Scaling

Question 12

Q

What is SDR?

Answer

A

Standard Deviation Ratio

Question 13

Q

What is AMS?

Answer

A

Anticipated Mean Shift

Question 14

Q

In SDR-AVS, what is the NIC counter?

Answer

A

When there are multiple local optima in your problem, it will take SDR-AVS too long to converge to one of them. It limits the adaption of variance in the estimation distribution.

Question 15

Q

Explain what the distribution muliplier does for SDR-AVS.

Answer

A

It will enlarge the variance of which new samples are taken

Question 16

Q

What are the two reasons that no improvement was found in the SDR-AVS?

Answer

Study These Flashcards

A

Either the vairance is too large and ED took samples of worse solutions.
Or the distribution covers multiple local optima, in which randomly sampled solutions between these optima can worsen the average.

Question 17

Q

What is the shortcomming of SDR-AVS?

Answer

Study These Flashcards

A

It solves searching in 1D space, but not in 2D space.

Question 18

Q

What is the idea of AMS?

Answer

Study These Flashcards

A

Anticipate where the mean will be shifiting, then alter part of generated solutions using that shfit.
Predictions on slope will be better. But we require balanced selection to re-align covariance matrix.

Question 19

Q

How is the shift in AMS calculated?

Answer

Study These Flashcards

A

𝝁𝑆ℎ𝑖𝑓𝑡(𝑡) =𝝁(𝑡) −𝝁(𝑡 − 1),
(𝝁 should be MEAN 𝝁)
such that 𝒙 ← 𝒙 + 𝛿 * 𝝁𝑆ℎ𝑖𝑓𝑡(𝑡).

Where t is the generation number and factor 𝛿 the impact of the shift.

Question 20

Q

What behaviour does AMS have on a peak?

Answer

Study These Flashcards

A

No change, as
𝝁(𝑡) ≈ 𝝁(𝑡 − 1)

Question 21

Q

Why is AMS still not a optimal approach for real-valued EDA?

Answer

Study These Flashcards

A

Because choosing the right parameters is really finicky and still requires a lot of attention.

Question 22

Q

What does AMaLGaM spell?

Answer

Study These Flashcards

A

Adapted Maximum-Likelihood Gaussian Model

Question 23

Q

What is the idea of AMaLGaM?

Answer

Study These Flashcards

A

It uses the variance approach from SDR-AVS to widen the search space. And it uses the mean-shift from AMS in order to faster move solutions down slopes.

Lecture 7 Flashcards

(23 cards)