6 - games and adversarial search Flashcards
Games vs. single-agent search
We don’t know how the opponent will act
Solution: strategy or policy + efficnecy
Minimax value of a node
the utility (for MAX) of being in the corresponding state, assuming perfect play on both sides
Minimax strategy
Choose the move that gives the best worst-case payoff
Alpha-Beta Pruning
Avoid processing subtrees that have no effect on the result
α is used in MIN nodes, and is assigned in MAX nodes ( α= “at least”)
β is used in MAX nodes, and is assigned in MIN nodes ( β = “at most”)
Beta cutoff
stop search below MAX node N (i.e., don’t examine more descendants) if alpha(N) >= beta(i) for some MIN node ancestor iof N
Alpha cutoff
stop search below MIN node N if beta(N)<=alpha(i) for a MAX node ancestor iof N
Expectiminimax
for chance nodes, average values weighted by the probability of each outcome
Monte Carlo simulation
when you get to a chance node, simulate a large number of games with random dice rolls and use win percentage as evaluation function