Lecture 27 - Reinforcement Learning and Motor Sequences Flashcards by Sydney Green

What is classical conditioning? What is the famous example?

a learned (reinforced) reflex/ response that is evoked by stimulus

Pavlov’s Dog

How well did you know this?

Not at all

Perfectly

What is reinforcement?

increase behaviour

How well did you know this?

Not at all

Perfectly

What is punishment?

decrease behaviour

How well did you know this?

Not at all

Perfectly

Give an example of positive reinforcement.

Sweet treat

How well did you know this?

Not at all

Perfectly

Give an example of negative reinforcement

take away homework

How well did you know this?

Not at all

Perfectly

Give an example of positive punishment.

writing lines

How well did you know this?

Not at all

Perfectly

Give an example of negative punishment

take away recess

How well did you know this?

Not at all

Perfectly

What does the process of reinforcement learning involve?

Learning to link reward with specific actions (and their outcomes) so they become repeated

How well did you know this?

Not at all

Perfectly

What is binary reward feedback?

Action is rewarded or not

How well did you know this?

Not at all

Perfectly

What is a scalar quantity eward feedback?

relative to the utility of action/reward outcomes

How well did you know this?

Not at all

Perfectly

What is the goal of reinforcement learning?

to maximize reward and minimize loss

How well did you know this?

Not at all

Perfectly

What is cummulative reward?

it might be better to sacrifice immediate reward for long-term reward

How well did you know this?

Not at all

Perfectly

What are some examples of cummulative rewards?

chess
investements

How well did you know this?

Not at all

Perfectly

What happens to actions that are associated with reward?

they become strengthened/repeated

How well did you know this?

Not at all

Perfectly

processes of reinforcement learning:

What is exploration?

the trial and error process of aquiring more information about the envrionemnt by searching possibilities

searchhing many action possibilites to determin which action tends to maximize reward

How well did you know this?

Not at all

Perfectly

processes of reinforcement learning:

What is exploitation?

capitalize on known information to maximize reward

actions associated with past history of reward tend to be repeated to maximize future reward

How well did you know this?

Not at all

Perfectly

What is the tradeoff between action exploration and exploitation?

Study These Flashcards

shift emphasis from exploring to exploitation to maximize reward

What is a hockey example of the exploration, exploitation trade-off?

Study These Flashcards

exploration - you find out goal tender is weak low
exploitation - shoot low to score goals (maximize reward)

What is the basal ganglia?

Study These Flashcards

Collection of subcortical structures in the brain

Where is dopamine produced?

Study These Flashcards

In the substantia nigra

What is dopamine?

Study These Flashcards

a neurotransmitter that is part of the brains intrinsic reward system

basal ganglia:

What are the 2 parts of the striatum?

Study These Flashcards

caudate nucleus
putamen

What is dopamine input into the striatum critical for?

Study These Flashcards

learning from reward and strengtheing the representation of specific actions

Learning the piano example:

What happens to serial actions with learning?

Study These Flashcards

can produce faster sequences with less errors. Key presses become smoother and linked together

What can practice do in terms of sequential actions?

Can link sequential actions into a single movement pattern

# Chuncking of individual elements in a motor sequence What happens in early practice?

shifting gears requires several dinstinct movements

# Chuncking of individual elements in a motor sequence What happens in middle practice?

independant actions are 'chunked' into a larger subunit of movement sequences

# Chuncking of individual elements in a motor sequence What happens in late practice?

actions become 'chunked' together into a cohesive movement sequence where successivce actions are 'coarticulated'

What is chunking?

fusing a series of individual elements into a larger subunit of a movement sequence

What is co-articulation?

adjacent movement elements influence each other

What does co-articulation cause?

a blending of discrete movements within a sequence and is associated with an ncrease in speed and smoothness of movement sequences

What does sequence learning alter?

processing in several sensory and motor brain areas

# Early stages of learning: What is the role of the dorsolateral prefrontal cortext?

- working memory, attention, action selection - conscious planning and processing of movements - likely involved in intial stages of chunking motor sequences

# Early stages of learning: What is the role of the inferior parietal cortex?

processing visual and somatosensory information

# Early stages of learning: What is the role of rostral premotor areas?

- planning motor actions - processing visual and somatosensory feedback

# Early stages of learning: What is the role of basal ganglia?

processing reward during action

# Later stages of learning: What is the role of supplementary motor area?

- selecting and planning previously learned motor responses - likely involved in later stages of motor chunking and coarticulation

# Later stages of learning: What is the role of the dorsal premotor area?

- processing visual and somatosensory information - involved in selecting learned actions

# Later stages of learning: What is the primary motor cortext involved in

producing motor actions

# Early stages of learning: What is the cerebellum involved in?

detecting and correcting errors

How does brain activity evolve during motor sequence learning?

Early stages require high cognitive demand and conscious prcoessing. Later stages require low cognitive demand and automatic processing.

Lecture 27 - Reinforcement Learning and Motor Sequences Flashcards

(41 cards)