Adversarial Search Flashcards
Reinforcement learning techniques has eclipsed __________ search for some games
alpha-beta
We represent 2-player games like chess as a ____ ______
game tree
In the game tree, nodes alternate between _____
moves
In a min/max search, MIN whats to ______ the objective function and MAX wants to ______ it
minimize, maximize
The _________ algorithm is the basic search techniques that computers use for making moves
MINIMAX
Alpha-beta pruning is a technique for efficiently _________ certain parts of the game tree
discarding
AlphaZero learned how to play chess in less than ___ hours of _____, but it did needs tons of _________ power
10, self-play, computational
AlphaZero uses a technique called…
Monte-Carlo Tree Search
What is the basic idea of the Monte-Carlo Tree Search?
- Pick random successor states until the game is played out
- Repeat thousands of times
- Select the move leading to the most wins
AlphaZero has also become a master at ______ and __
Shogi, Go
_____ _______ was the first machine to beat a grandmaster in a tournament. It was later brought to IDB and renamed ____ _____
Deep Thought, Deep Blue