lecture 4 Flashcards

Question 1

Q

Dennett’s cognitive wheel

Answer

A

The fact that you can invent something that works does not guarantuee that you have learned something about nature.

Question 2

Q

Intelligence

Answer

A

Ability to deal with difficult and novel problems

o Adaptivity
o Very close to creativity

Question 3

Q

Time scales of adaptive behavior

Answer

A

Extremely slow (evolutionary)
e.g., cuckoo
Slow (operant conditioning)
e.g., skinner box
Fast (problem solving)
intelligence

Question 4

Q

Think aloud protocols

Answer

A

Write down what people think during chess and replicate these processes

Question 5

Q

Drosophilia idea

Answer

A

o By building AI for chess, we hopefully learn how chessplayers think

o This failed.

Question 6

Q

why is chess difficult

Answer

A

Explosion of possibilities (novel positions)
a. Combinatorial explosion
What is a good position?

Question 7

Q

simple problems

Answer

A

Problems for which the time to solve the problem does not increase fast when the problem becomes bigger

 Polynomial time

Question 8

Q

hard problems

Answer

A

The solution times increases very fast
o Takes infinite amount of time
o Same with chess

 Bc it checks all the positibilities
 Non-polynomial time
 If you solve one, you solve everything

Question 9

Q

classic AI solutions to solving problems

Answer

A

Tree algorithms
Evaluation function
Build in a huge opening book
End game table bases

Question 10

Q

Alpha-beta pruning

Answer

A

Stops evaluating a move when at least one possibility has been found that proves the move to be worse than a previously examined move

so they don’t follow the whole tree

this way they can go much deeper

Question 11

Q

deep blue

Answer

A

 Hardware + 4 tricks (previously mentioned)
 Kasparov lost against Deep Blue

is a cognitive wheel because this is not how humans solve problems/learn

Question 12

Q

Learning in AI

Answer

A

Deep learning
a. Supervised and unsupervised learning
b. Neural networks
Reinforcement learning
a. Learn interations that are rewarded

Question 13

Q

Q learning

Answer

A

 Fill out the q-table by randomly walking and sometimes it gets a reward. This creates the fastest pathway to the goal.
 There is also exporation vs exploitation in these tables
 Having sub goals also helps

Question 14

Q

Deep reinforcement learning

Answer

A

 Q-table is replaced by deep learning NN that predicts next moves and learns from rewards

Question 15

Q

Monte Carlo Tree search

Answer

A

 Start with random move and play out the game entirely with random moves
o Rollout

 Do this multiple times and use average result to add values to moves A and B

 When MCTS combined with deep reinforcement learning, it gives very powerful learning

> no evaluation of positions, just win/draw/loss at the end of a rollout

Question 16

Q

minimax tree

Answer

Study These Flashcards

A

search all paths until a certain depth and then use the evaluation function

Question 17

Q

alphazero chess

Answer

Study These Flashcards

A

o Self-play
 Played against itself all the time
o Reinforcement learning
o Deep neural network
o MCTS

Question 18

Q

Bongard problems

Answer

Study These Flashcards

A

humans generally outperform AI systems on these types of AI fluid intelligence tests

Question 19

Q

brute force approach to NP problems

Answer

Study These Flashcards

A

successful but not informative about human intelligence

lecture 4 Flashcards

(19 cards)