Lecture 9 - Adaptive Control Flashcards
most ____ skills improve with practice
sensorimotor
Proof that VOR learns to adjust
VOR restores retinal-image stability when vision is altered by lenses - magnifying glasses take ~ 30 min for VOR to grow stronger - minimizing glasses makes VOR grow weaker - glasses that flip the world upside down take about a week for VOR to adjust (VOR changes direction)
what happens to saccades when eye muscle is damaged?
saccades miss target and drift
if eye muscle damage isn’t too sever, what happens to saccades?
neural adaption restore saccade accuracy and eliminate drift, even if the muscle is damaged
controller must know plant well, but…
we do not have to be born with accurate knowledge of our plant - plant changes through life - controller learns
How do control networks adjust themselves to improve performance?
- error-driven learning
- Learning by perturbation
- Gradient-descent learning
controllers learn the properties of their plants based on…
sensory feedback (learn from trying, examples…)
what is the aim of learning?
minimize average error (aka risk, aka expected loss)
error (e) =
y - y*
loss (L) =
|e|2/2
What does the learner want?
minimize error & average loss E(L)
want both these to get 0
risk depends on…
probabilities of different situations (not every input has the same loss)
how does the brain estimate risk?
Learning by perturbation
learning by perturbation
make small changes to the weights and accept the ones that reduce error;
wi + η
If |epert| < |e|, then the perturbed weights are accepted
each decision to accept or reject perturbed weights is based on…but…
single input z, but overtime the neuron samples many inputs and the OVERALL risk is reduced