R17 Questions Flashcards
What major accomplishment is achieved by the Deep Mind software described by Koch?
Deep mind software developed its own strategies to beat human players in Breakthrough.
What 3 features from neurobiology are incorporated in the algorithm?
Deep convolution networks, reinforcement learning (operant conditioning), & selective memory replay
At the beginning of training, the algorithm improves based on _______________.
trial and error
What is a deep convolutional neural network (CNN)?
It uses multiple layers of simulated neurons that have weighted inputs. Units turn on and off depending on the sum of input weights they receive.
CNNs are modeled after which brain system?
The mammalian visual system
What is the difference between supervised learning and reinforcement learning?
In supervised learning, every input image is paired with a specific label. In reinforcement learning, the consequence of any action in a game score unfolds in time.
What is selective memory replay and where in the brain is it thought to occur?
Selective memory replay is when neurons repeat the same patterns of activation displayed earlier.
It occurs in the hippocampus.
Name one weaknesses of the algorithm?
One weakness is that CNNs are unable to plan for the long-term.