Lecture 8 Flashcards

1
Q

What does FOS stand for?

A

Family of Subsets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the definition of a FOS?

A

FOS β„± is a subset of powerset of 𝑆, i.e.,
β„± βŠ† β„˜(𝑺) where β„± = {𝑭0, 𝑭1, … }

Where the powerset is the set of all possible subsets, including the full and null set. And 𝑺 is the set of indices of the solution variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the linkage set?

A

A FOS in which every variable of the genotype is in at least one FOS subset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the univariate FOS?

A

Every variable is in its own individual FOS subset; independent from other other variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the MP FOS?

A

The Marginal Product FOS is a FOS where every variable is in only one FOS subset.

Such that every FOS subset is independent from each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a LT FOS?

A
  • The linkage tree (LT) FOS is a hierarchical structure
  • The group of all variables is in there
  • For any subset 𝑭𝑖 with more than one variable, there
    .are subsets 𝑭𝑗 and π‘­π‘˜ such that:
    𝑭𝑗 ∩ π‘­π‘˜ = βˆ…,
    𝑭𝑗 < 𝑭𝑖 ,
    π‘­π‘˜ < 𝑭𝑖 ,
    π‘Žπ‘›π‘‘ 𝑭𝑗 βˆͺ π‘­π‘˜ = 𝑭𝑖
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How does FOS do variation?

A

Recombination of two solutions via crossover.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

For which type of FOS does crossover not make sense and why?

A

LT FOS; The hierarchical structure of LT makes it so that the crossovers that happened in the lower tree levels are overwritten by the larger subsets. This makes the crossover very complex to understand and therefor not useful in practise. Also large probability the tree is no longer valid because we dont evaluate during crossover.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can we implement FOS subsets for EDA?

A

We have a probability table for each subset in the FOS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is the size of the probability table of the FOS subsets for EDA equal to 2 ^|𝑭𝑖|βˆ’ 1 ?

A

The probability of each combination of FOS subsets must be represented (hence binary part). The last probability can be calculated using 1 - sum, so it does not need to be stored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is the probability table for FOS-based EDA determined?

A

Maximum Likelihood (ML) aka frequency counting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Consider univariate FOS in GA and EDA.
Is there any difference?

A

Variables treated completely independently only in EDA. Whereas for GA it depends which solutions are the parents of the offspring. This introduces weak dependencies between the variables and the variation operator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does ECGA stand for?

A

Extended Compact Genetic Algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What kind of EA is ECGA an example of?

A

MP FOS learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does ECGA work?

A
  • Each MP structure is scored.
  • Choose probability distribution with the best score
  • We then use greedy search, because prob. model is not our goal so using greedy is plausible
  • Then use standard MPM:
  • Start joining univariate variables until no improvement is found
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is MDL?

A

Minimum Discription Length:
A measure of complexity.

17
Q

What are the two subcomponents of MDL?

A
  1. Compressed population complexity: how good is the prob. distribution est.
  2. Model complexity: number of bits required to store all parameters of the model
18
Q

Do we want to maximize or minimize MDL?

A

Minimize

19
Q

Explain why LT FOS is both dependent and independent

A

The Linking Tree acts as a path through dependence space, from univariate to joint.

20
Q

What does OMEA stand for?

A

Optimal Mixing EA

21
Q

What is the main characteristic of OMEA?

A

It uses intermediate function evaluations inside variation operator

22
Q

What is ROM?

A

Recombinative OM: GA-like where you select a single solution to perform OM with

23
Q

What is GOM

A

Gene-pool OM: EDA-like where you select a new solution for each substructure in OM.

24
Q

Does OMEA ensure elitism?

A

Yes, because it only improves solution

25
Q

Explain the psuedo code for ROMEA

A

You randomly select two parents, p0 and p1.
Then you make two copies who act as the offspring: o0 and o1.
For each FOS, you crossover o1 and o2 (for that FOS) and see if the fitness of o0 improved.
If o0 improved, keep the change otherwise revert.
At the end, only return o0.

26
Q

Explain the psuedo code for GOMEA

A

You first create a single offspring from a single parent (copy).
Then you iterate over the amount of FOS/linkage sets.
For every FOS you randomly pick a new parent and crossover for that specific FOS.
If there was an improvement on the offspring, keep the change.

27
Q

Why do OMEAs require much smaller populations?

A

In order to sample every possible genotype you must sample 𝑛 β‰₯ 2^β„“.
However for OMEA, we only need to have sampled every block/FOS once, which is 𝑛 β‰₯ 2^π‘˜.

Where k is block length.

28
Q

Give an example of why choosing a population size of 2 will not be able to find the best solution. (OneMAx)

A

Due to the crossover operator and the selection process, it could happen that f.i. the 1 one a specific variable gets lost.
This is called a Unrecoverable problem !!! This is solved using GOMEA

29
Q

What are the four parallel models for Evo Computation (EC)

A
  1. β€˜Embarrassingly’ parallel
  2. Parallel evaluation of the population
  3. Parallel evaluation of single solution
  4. Parallel Island model
30
Q

Describe β€˜embarrassingly’ parallel

A

Perform N runs in paralles, which will result in N outcomes.
Then you can do one final evaluation of the N outcomes to reduce it to one coutcome.

31
Q

Describe Parallel evaluation of the population.

A

Generally the fitness function requires the most computation power, hence it makes sence to speed this up using parallelism.

32
Q

What is required fo parallel evaluation of single solution?

A

Requires knowledge about optimization problem (e.g. GBO setting)

33
Q

Explain the general Parallel Island Model.

A

Where there are multiple EAs working simulteneously, that migrate parts of the population to other EAs at a certain point.

34
Q

When is Parallel Island Model Homogeneous?

A

Each of N islands is the
same EA with identical parameters.

35
Q

When is Parallel Island Model Heterogeneous?

A

Each of N islands my be a different EA with different parameters.

36
Q

How does the migration size (q) influence our diversity and convergence rate?

A
  • High q: Islands converge to same solution; large communication cost.
  • Low q: Slower convergence; more diversity.
37
Q

Where does VIG stand for

A

Variable Interaction Graph