Lecture 2 Flashcards

1
Q

What new idea is introduced in sGA compared to general EA format?

A

Parent selection before variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What kind of Parent Selection does sGA have?

A

Proportionate selection:
Meaning that each parent has its own probability to be selected as a parent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What scoring system is used to determine the parents in sGA?

A

The individuals with higher fitness have higher probability of being select as parent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the three issues with proportionate selection in sGA?

A
  • Required: fitness maximization and non-negative fitness
  • Far-above average individuals quickly dominate population
  • Similar fitness (convergence) dissolves selection pressure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of variations are used in sGA?

A

One point, two point or uniform- crossover.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is the next generation of population selected in sGA?

A

surivor selection: replace population with offspring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

describe psudo code for sGA

A

1 𝑑 ← 0
2 𝑷𝑑← π‘π‘Ÿπ‘’π‘Žπ‘‘π‘’π΄π‘›π‘‘πΈπ‘£π‘Žπ‘™π‘’π‘Žπ‘‘π‘’πΌπ‘›π‘–π‘‘π‘–π‘Žπ‘™πΌπ‘›π‘‘π‘–π‘£π‘–π‘‘π‘’π‘Žπ‘™π‘  𝑛
3 π’˜π’‰π’Šπ’π’† π‘‘π‘’π‘Ÿπ‘šπ‘–π‘›π‘Žπ‘‘π‘–π‘œπ‘›πΆπ‘Ÿπ‘–π‘‘π‘’π‘Ÿπ‘–π‘œπ‘›π‘π‘œπ‘‘π‘†π‘Žπ‘‘π‘–π‘ π‘“π‘–π‘’π‘‘(𝑷𝑑) 𝒅𝒐
3.1 𝑺𝑑 ← π‘π‘Ÿπ‘œπ‘π‘œπ‘Ÿπ‘‘π‘–π‘œπ‘›π‘Žπ‘‘π‘’π‘†π‘’π‘™π‘’π‘π‘‘π‘–π‘œπ‘›π‘ƒπ‘Žπ‘Ÿπ‘’π‘›π‘‘π‘ (𝑷𝑑, 𝑛)
3.2 𝑢𝑑 ← βˆ…
3.3 𝝅 ← π‘Ÿπ‘Žπ‘›π‘‘π‘œπ‘šπ‘ƒπ‘’π‘Ÿπ‘šπ‘’π‘‘π‘Žπ‘‘π‘–π‘œπ‘›(𝑛)
3.4 𝒇𝒐𝒓 𝑖 ∈ {0,1, … , (𝑛/2) βˆ’ 1} 𝒅𝒐
3.4.1 𝑢𝑑 ← 𝑢𝑑 βˆͺ π‘π‘Ÿπ‘œπ‘ π‘ π‘œπ‘£π‘’π‘Ÿπ΄π‘›π‘‘π‘€π‘’π‘‘π‘Žπ‘‘π‘–π‘œπ‘›(𝑆𝝅2𝑖𝑑 , 𝑆𝝅2𝑖+1𝑑 , 𝑝𝑐, π‘π‘š)
3.5 π‘’π‘£π‘Žπ‘™π‘’π‘Žπ‘‘π‘’π΄π‘™π‘™π‘†π‘œπ‘™π‘’π‘‘π‘–π‘œπ‘›π‘ (𝑢𝑑)
3.6 𝑷𝑑+1 ← 𝑢𝑑
3.7 𝑑 ← 𝑑 + 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why do we require a schema?

A

To investigate the dynamics of sGA so that we can determine if the GA will work well.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the domain of an Schema?

A

Is a string (vector) of length l that contains 0,1 or ⋆

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a Schema represent?

A

Schema represents subset of all β„“-dimensional binary genotypes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an other common name for Schema?

A

a similarity subset:
– Set of genotypes that are similar according to schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does it mean in schemas when: schema 𝒉 iff 𝒙 ∈ 𝒉

A

That genotype/solution 𝒙 has matched with schema 𝒉

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the general purpose of Schema Analysis?

A

– Observe change in average fitness of a schema
in a single generation
– Extrapolate result over multiple generations
– Gives a prediction of the behavior of the average
schema fitness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In Schema Analysis, what does πœ‡(𝒉, 𝑑) (mu) denote?

A

The number of matches of schema 𝒉 in generation t in population P^t.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In Schema Analysis, what does πœ‘(𝒉, 𝑑) (phi) denote?

A

The population-based cumulative fitness of a schema:
The sum of the fitness values of all genotypes in 𝑷^t that matched with schema 𝒉 in generation t.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we calculate population-based average schema fitness?

A

Cumulative fitness over number of matches for a certain schema:
HAT! πœ‘(𝒉, 𝑑) = πœ‘(𝒉, 𝑑) / πœ‡(𝒉, 𝑑)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does β˜† mean in Shema analysis?

A

A unique schema of length β„“, that only includes ⋆.
β˜† = ⋆⋆..⋆⋆

(Basically: it includes all solutions. )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How can you express proportionate selection probability using Schema Analysis?

A

The fitness of that Individual over the cumulative fitness of the total population.
p = fitness(P^t_i) / πœ‘(β˜†, 𝑑)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

In Schema Analysis, what does πœ“(𝒉, 𝑑) (psi) mean?

A

The probability of a selection being matched by 𝒉 is πœ“(𝒉, 𝑑). Which is denoted by :
πœ‘(𝒉, 𝑑) / πœ‘(β˜†, 𝑑)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the expected number of copies of instances of schema 𝒉 in the next generation without variation? (equation)

A

𝑛 πœ“(𝒉, 𝑑) = πœ‡(𝒉, 𝑑) HAT πœ‘(𝒉,𝑑) / HAT πœ‘(β˜†,𝑑)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What can you conclude in Schema Analysis if population-based average schema
fitness remains larger than the average population fitness?

A

then # matches of 𝒉
grows exponentially

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How do we denote Cumulative Population fitness?

A

πœ‘(β˜†, 𝑑)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How do we denote population-based average schema fitness?

A

HAT πœ‘(𝒉, 𝑑)

24
Q

How do we denote average
.population fitness

A

HAT πœ‘(β˜†, 𝑑)

25
Q

What does πœ–(𝒉) denote?

A

probability of schema disruption when applying variation.

26
Q

What does disruption mean in schema analysis?

A

at least one parent ** matches** a schema
but the offspring do not match that schema

27
Q

What does the schama growth equation tell us?

A

The expected number of copies of instances of schema 𝒉 in the next generation with variation!

28
Q

What is the schema growth equation?

A

𝛦[πœ‡(𝒉, 𝑑 + 1) β‰₯ (1 βˆ’ πœ–(𝒉)) * πœ‡(𝒉, 𝑑) * HATπœ‘(𝒉, t) / HAT πœ‘(β˜†, 𝑑)

29
Q

When the expected number of times a schema appears in
the population increases, what do we expect to happen to the probability of schema disruption?

A

It becomes as small as possble. Because we maximize the (1 βˆ’ πœ–(𝒉)) term in our formula.

30
Q

Why is the problem that the average population fitness catches up with the schema fitness, not really a problem?

A

By the time this happens, most genotypes have already converged to match that highly-fit schema.

31
Q

When does the probability of disruption increase in Schema Analysis?

A

– with defining length of schema under study (i.e., distance
between outer fixed bits)
– and with order of schema (number of fixed bits in schema)

32
Q

Why do we need multiple runs of the same algorithm?

A

The random initial population and the randomness in variationa and mutations means that this algorithm is stochastic. A singular run can also lead to local optimalisations, instead of the optimal solution.

33
Q

What kind of selection is Tournament selection?

A

Rank/dominate based

34
Q

What is the advantage of rank/domination based selection?

A

It removes the dependency on the actual fitness value.

35
Q

What does a Tournament Selection with replacement mean?

A

Where individuals can be parents to multiple offspring. Due to the fixed number of tournaments, this can mean that some individuals do not get the chance to create offspring.

36
Q

What does a Tournament Selection without replacement mean?

A

Every individual becomes a parent, before individuals are assigned a second time to become a parent. This makes sure all individuals in a population are concidered during selection.

37
Q

How often is a solution considered in tournament selection without replacement?

A

(sn)/N times.
Where s is the size of each tournament, n is the amount of resulting solutions, and N is the population set.

38
Q

How many tournaments are needed to select n solutions in a generation for Tournament Selection?

A

n solutions = n tournaments

You only select the best solution each time.

39
Q

Does high selection pressure result in faster or slower convergence?

A

faster

40
Q

Is it easy to exert high selection pressure in tournament selection?

A

Yes. By selecting only the best solution per tournament, it increases the pressure that only the best are selected.

41
Q

What is the formula for the Tournament Selection Takeover Time
?

A

t β‰ˆ log(𝑛 log(𝑛)) /
log(𝑠)
where 𝑛 is the number of tournaments per generation (often equal to pop size)
and 𝑠 is tournament-size

42
Q

Why is population size almost irrelevant for the efficiency in tournament selection?

A

The takeover time scales logrithmic, hence have less impact on.

43
Q

What does the P+O setting mean in Tournament selection?

A

That the tournament will select the best one out of two individuals and their offspring.

Furthermore, the best solution gets 2 copies in next gen

44
Q

Why is the P+O setting favorable in tournament selection?

A

**Ensures elitism: **
Best solution found so far does not get lost (due to variation)

45
Q

Why does the best ind. get 2 copies in a P+O setting?

A

This makes convergens to (local) optimum faster.

46
Q

What is the biggest downside of high(er) mutation probability?

A

It slows down overall convergence.

47
Q

Does increasing tournament selection size also increase selection presure?

A

Yes, a individual now needs to be (relatively) better to more individuals in order to be the best.

48
Q

How does a uniqueness constraint on genes influence the efficiency of sGA’s? and how do we solve this?

A

Because the genetype now consists of unique genes, crossover is not (necessarily) possible anymore. This breaks sGA.

We solve this by introducing a repair operators.

49
Q

What is the def. of a permutation?

A

the action of changing the arrangement, especially the linear order, of a set of items.

50
Q

What is a repair operator and how does it work?

A

An extra operation for your algorithm that makes sure that permutation problems stay in the feasible solution space.

It detects clashes during crossover and replaces problems with unused numbers. Both the clashes and unused numbers are picked in random order.

51
Q

What is a permutation-specific operator?

A

An improved version of a repair operator, which aims to remove clashes by the means of new variators, specifically for permutations.

52
Q

Explain Davis’s order crossover (OX)

A
  • 2 parent permutations, 1 offspring
  • Copy a part from the first parent.
  • Read the rest of the second parent and omit elements that are already present in offspring
  • after cut, wrap around to the start.
53
Q

Explain Partially Mapped Crossover (PMX)

A

– 2 parent permutations, 1 offspring
– Copy part from first parent to offspring
– Now the second parent will look at the indexes of the already copied genes, and sees what its own genes are.
- if these genes don’t match (which is very likely) it will place its own gene from that index, at the index of what the offsprings gene is.

9 0 7 2 8 6 5 3 1 4
2 5 7 9 4 6 0 3 8 1
β†’
. . . 2 8 6 5 . . .
(now: 9 needs to go
where 2 was)
β†’
9 . . 2 8 6 5 . . .
(now: 4 needs to go
where 8 was)

54
Q

Explain Edge crossover (EX)

A

– 2 parent permutations, 1 offspring
- First,for each element list elements it is connected to (neighbours) in either parent.
- Pick a random element as your starter
- Then, pick the next element from the adjacency list from the previous selected element. Pick the one with the shortest adjecency list. (make sure you omit elements that already have been used.)
- if no neighbours are still available, pick a random element to continue with.
- Repeat until all elements are present in offpsring.

55
Q

Explain Bean’s generic solution to permutations

A
  • 2 parents 1 offspring.
  • For each parent, assign a (random) real value (0,1) to each gene to indicate it’s index in that genotype.
  • Now the standard crossovers are available to use on the default genotype, to ensure all elements are present.
  • after crossover, you can sort the elements in the genotype based on their index value