Class 4 - Adaptation Flashcards
3 types of adaptive processes
- Evolutionary algorithms
- Reinforcement learning
- Learning by demonstration
Evolutionary robots…
- are created to adapt to their environment via evolutionary computing
- can generate off-springs
- mainly focus on evolving the brain
- can work on adjusting issues by themselves
Genotypes
numbers that describe the phenotype of the evolutionary robot
Genotypes can be…:
A. discrete numbers between [0, 1]
B. continuous numbers between [0, 1]
C. both A and B
D. neither A nor B
C. both A and B
T/F: Different parts of the genotype can describe different parts of the phenotype of the robot
True
An evolutionary algorithm can be split into two main phases…
- the “testing phase” –> where the robot is put in the environment and the fitness function is evaluated
- the “generation phase”, where the resulting fitness function is used to create the next generation of offsprings
3 methods to create off-springs (in the context of evolutionary algorithms)…
- genetic algorithm
- evolutionary strategy
- modern evolutionary strategy
Match the off-spring method (in the context of evolutionary algorithms) to its description…
A. genetic algorithm
B. evolutionary strategy
C. modern evolutionary strategy
- only allows continuous values in the genotype which are then crossed-over with mutation (which allows a number between 0 and 1) to create the genotype of the off-spring. The first generation of genotypes can be random and parents can outlive the children if they have a better fitness function.
- only allows discrete values in the genotype which are then crossed-over with (flipped value) mutation to create the genotype of the off-spring. Also, the first generation of genotypes can be random.
- Uses correlation to see whether the previous children were performing better than the new version. Only uses one parent since it is computationally challenging.
A-2
B-1
C-3
Advantages of evolutionary algorithms
If one component is weaker, the others can compensate!
Disadvantages of evolutionary algorithms
- unbounded complexity does not happen
- adaptations are short-term → need to fix problems immediately
- prevents exploring new ideas because offspring comes from a certain family
T/F: in RL, only the off-spring “carries” the improvement
False, that’s the case generally for evolutionary algorithms. In RL, the robot improves itself continuously given a reward
Policy
explains to the robot which action to take in a certain state
Is the policy “ideal” before the robot starts exploring?
No, the robot updates the policy as it explores the environment
Two types of policies
- deterministic
2. stochastic
Gaussian policy is an example of…
A. deterministic policy
B. stochastic policy
B. stochastic policy