Exam 3 Flashcards

Question

What is localization?

Answer 1

An agent working out where it is, given a map of it's environment and a sequence of percepts and actions

Answer 2

Agents that compute a complete solution before taking their first action

Answer 3

Agents that interleave computation and action; first it takes an action, then observes the environment, then computes the next action

Answer 4

In dynamic/semi-dynamic environments where an agent should not compute for too long, as well as nondeterministic domains

Answer 5

A robot is placed in an unknown building and must explore to build a map that can later be used for getting from A to B, usually an example of online search

Answer 6

The comparison between the estimated cost of the total path and the optimal path

Answer 7

The adversary constructing the state space while the agent explores it, with the adversary placed goals and dead ends wherever it chooses

Answer 8

Some goal state is reachable from every reachable state

Answer 9

The agent simply selects at random one of the available actions from the current state, with preference to unexplored actions being possible

Answer 10

The assumption that an unexplored state will lead immediately to the goal with the least possible cost

Answer 11

Two or more agents having conflicting goals, which creates adversarial problems

Answer 12

The aggregate of a very large number of agents; increasing demand will cause prices to rise in general, don't have to predict the action of individuals

Answer 13

It "cuts off" (ignores) portions of the search tree that make no difference to the optimal move, saves time

Answer 14

It estimates who is winning based on features of the state

Answer 15

Games where not all possible information is known, such as card hands as in UNO or Poker

Answer 16

What is good for one player is just as bad for the other; no win-win outcome

Answer 17

Initial state (s0), To-Move(s), Actions(s), Result(s,a) (the transition model), Is-Terminal(s) (the terminal test), and Utility(s,p)

Answer 18

A search tree that follows every sequence of moves all the way to a terminal state

Answer 19

An algorithm used on a two-ply (two players) game tree, alternates between moves taken by each player, with each state having a minimax value (usually depending on the max player); can be thought of as working backwards from the terminal states to the root

Answer 20

The optimal choice for max that leads to the state with the highest minimax value

Answer 21

Time complexity: O(b^M); space complexity O(bm)

Answer 22

Similar to minimax, but implements pruning of decisions that wouldn't affect the optimal route; saves time and memory

Answer 23

alpha is the value for the best choice for the MAX player and beta is the best choice for the MIN player

Answer 24

The order in which the states are examined

Answer 25

Different permutations of the move sequence that end up in the same position

Answer 26

Considering all possible moves to a certain depth in the search tree, and then using a heuristic evaluation function to estimate the utility of states at that depth (explores wide but shallow portion)

Answer 27

Ignores moves that look bad and follows promising lines "as far as possible" (explores deep but narrow portion)

Answer 28

A test that returns true for terminal tests, otherwise it is free to decide when to cut off the search

Answer 29

For terminal states, Eval(s,p) = Utility(s,p); For nonterminal states, Utility(loss,p) <= Eval(s,p) <= Utility(win,p)

Answer 30

Features of the states, which are the characteristics of the game

Answer 31

Eval(s) = w1f1(s) + ... + wnfn(s); each feature is multiplied by their weight (how important a feature is)

Answer 32

Positions in which there is no pending move that would wildly swing the evaluation

Answer 33

When an opponent's move will cause serious damage and is ultimately unavoidable but can be temporarily avoided by using delaying tactics

Answer 34

Moves that are "clearly better" than all other moves in a given position, even when the search would normally be cut off at that point

Answer 35

Pruning that prunes moves that appear to be poor moves but could be possibly be good ones (is a Type B strategy)

Answer 36

It reduces the depth to which we search for good moves, which can save time; based off the alpha value and decides if it re-runs the search with full depth

Answer 37

It works by using the value of a state that is estimated as the average utility over a number of simulations of complete games starting from the state

Answer 38

A bias for the good moves

Answer 39

Do N simulations starting from the current state of the game, and track which of the possible moves from the current position has the highest win percentage

Answer 40

It focuses on selecting the computational resources on the important parts of the game tree, balanced on explorations of states with few playouts and exploitation of states that have had well past playouts

Answer 41

Stopping a playout that is taking too many moves; evaluate it with a heuristic evaluation function or just declare it a draw

Answer 42

Games with a bit of unpredictability caused by a random element (such as throwing dice)

Answer 43

Often shown as circles as trees, denote all possible outcomes of an action that's based off chance (all possible dice rolls)

Answer 44

The average over all possible outcomes of the chance nodes

Answer 45

A generalization of the minimax value for deterministic games

Exam 3 Flashcards

Chapters 4 and 5 (69 cards)