The linkage tree (LT) FOS is a hierarchical structure The group of all variables is in there For any subset 𝑭𝑖 with more than one variable, there .are subsets 𝑭𝑗 and 𝑭𝑘 such that: 𝑭𝑗 ∩ 𝑭𝑘 = ∅, 𝑭𝑗 < 𝑭𝑖 , 𝑭𝑘 < 𝑭𝑖 , 𝑎𝑛𝑑 𝑭𝑗 ∪ 𝑭𝑘 = 𝑭𝑖

Each MP structure is scored. Choose probability distribution with the best score We then use greedy search, because prob. model is not our goal so using greedy is plausible Then use standard MPM: Start joining univariate variables until no improvement is found

Lecture 8 Flashcards by Sven Dukker

What does FOS stand for?

Family of Subsets

How well did you know this?

Not at all

Perfectly

What is the definition of a FOS?

FOS ℱ is a subset of powerset of 𝑆, i.e.,
ℱ ⊆ ℘(𝑺) where ℱ = {𝑭0, 𝑭1, … }

Where the powerset is the set of all possible subsets, including the full and null set. And 𝑺 is the set of indices of the solution variables

How well did you know this?

Not at all

Perfectly

What is the linkage set?

A FOS in which every variable of the genotype is in at least one FOS subset.

How well did you know this?

Not at all

Perfectly

What is the univariate FOS?

Every variable is in its own individual FOS subset; independent from other other variable

How well did you know this?

Not at all

Perfectly

What is the MP FOS?

The Marginal Product FOS is a FOS where every variable is in only one FOS subset.

Such that every FOS subset is independent from each other.

How well did you know this?

Not at all

Perfectly

What is a LT FOS?

The linkage tree (LT) FOS is a hierarchical structure
The group of all variables is in there
For any subset 𝑭𝑖 with more than one variable, there
.are subsets 𝑭𝑗 and 𝑭𝑘 such that:
𝑭𝑗 ∩ 𝑭𝑘 = ∅,
𝑭𝑗 < 𝑭𝑖 ,
𝑭𝑘 < 𝑭𝑖 ,
𝑎𝑛𝑑 𝑭𝑗 ∪ 𝑭𝑘 = 𝑭𝑖

How well did you know this?

Not at all

Perfectly

How does FOS do variation?

Recombination of two solutions via crossover.

How well did you know this?

Not at all

Perfectly

For which type of FOS does crossover not make sense and why?

LT FOS; The hierarchical structure of LT makes it so that the crossovers that happened in the lower tree levels are overwritten by the larger subsets. This makes the crossover very complex to understand and therefor not useful in practise. Also large probability the tree is no longer valid because we dont evaluate during crossover.

How well did you know this?

Not at all

Perfectly

How can we implement FOS subsets for EDA?

We have a probability table for each subset in the FOS.

How well did you know this?

Not at all

Perfectly

Why is the size of the probability table of the FOS subsets for EDA equal to 2 ^|𝑭𝑖|− 1 ?

The probability of each combination of FOS subsets must be represented (hence binary part). The last probability can be calculated using 1 - sum, so it does not need to be stored.

How well did you know this?

Not at all

Perfectly

How is the probability table for FOS-based EDA determined?

Maximum Likelihood (ML) aka frequency counting.

How well did you know this?

Not at all

Perfectly

Consider univariate FOS in GA and EDA.
Is there any difference?

Variables treated completely independently only in EDA. Whereas for GA it depends which solutions are the parents of the offspring. This introduces weak dependencies between the variables and the variation operator.

How well did you know this?

Not at all

Perfectly

What does ECGA stand for?

Extended Compact Genetic Algorithm

How well did you know this?

Not at all

Perfectly

What kind of EA is ECGA an example of?

MP FOS learning

How well did you know this?

Not at all

Perfectly

How does ECGA work?

Each MP structure is scored.
Choose probability distribution with the best score
We then use greedy search, because prob. model is not our goal so using greedy is plausible
Then use standard MPM:
Start joining univariate variables until no improvement is found

How well did you know this?

Not at all

Perfectly

What is MDL?

Study These Flashcards

Minimum Discription Length:
A measure of complexity.

What are the two subcomponents of MDL?

Study These Flashcards

Compressed population complexity: how good is the prob. distribution est.
Model complexity: number of bits required to store all parameters of the model

Do we want to maximize or minimize MDL?

Study These Flashcards

Minimize

Explain why LT FOS is both dependent and independent

Study These Flashcards

The Linking Tree acts as a path through dependence space, from univariate to joint.

What does OMEA stand for?

Study These Flashcards

Optimal Mixing EA

What is the main characteristic of OMEA?

Study These Flashcards

It uses intermediate function evaluations inside variation operator

What is ROM?

Study These Flashcards

Recombinative OM: GA-like where you select a single solution to perform OM with

What is GOM

Study These Flashcards

Gene-pool OM: EDA-like where you select a new solution for each substructure in OM.

Does OMEA ensure elitism?

Study These Flashcards

Yes, because it only improves solution

Explain the psuedo code for ROMEA

You randomly select two parents, p0 and p1. Then you make two copies who act as the offspring: o0 and o1. For each FOS, you crossover o1 and o2 (for that FOS) and see if the fitness of o0 improved. If o0 improved, keep the change otherwise revert. At the end, only return o0.

Explain the psuedo code for GOMEA

You first create a single offspring from a single parent (copy). Then you iterate over the amount of FOS/linkage sets. For every FOS you randomly pick a new parent and crossover for that specific FOS. If there was an improvement on the offspring, keep the change.

Why do OMEAs require much smaller populations?

In order to sample every possible genotype you must sample 𝑛 ≥ 2^ℓ. However for OMEA, we only need to have sampled every block/FOS once, which is 𝑛 ≥ 2^𝑘. Where k is block length.

Give an example of why choosing a population size of 2 will not be able to find the best solution. (OneMAx)

Due to the crossover operator and the selection process, it could happen that f.i. the 1 one a specific variable gets lost. This is called a **Unrecoverable problem** !!! This is solved using GOMEA

What are the four parallel models for Evo Computation (EC)

1. ‘Embarrassingly’ parallel 2. Parallel evaluation of the population 3. Parallel evaluation of single solution 4. Parallel Island model

Describe 'embarrassingly' parallel

Perform N runs in paralles, which will result in N outcomes. Then you can do one final evaluation of the N outcomes to reduce it to one coutcome.

Describe Parallel evaluation of the population.

Generally the fitness function requires the most computation power, hence it makes sence to speed this up using parallelism.

What is required fo parallel evaluation of single solution?

Requires knowledge about optimization problem (e.g. GBO setting)

Explain the general Parallel Island Model.

Where there are multiple EAs working simulteneously, that migrate parts of the population to other EAs at a certain point.

When is Parallel Island Model Homogeneous?

Each of N islands is the same EA with identical parameters.

When is Parallel Island Model Heterogeneous?

Each of N islands my be a different EA with different parameters.

How does the migration size (q) influence our diversity and convergence rate?

* High q: Islands converge to same solution; large communication cost. * Low q: Slower convergence; more diversity.

Where does VIG stand for

Variable Interaction Graph

Lecture 8 Flashcards

(37 cards)