Lecture 3 Flashcards by Cypress XVIII

What does a search algorithm do (in generic terms)?

A search algo takes a search problem as input and returns a solution or indication of failure.

How well did you know this?

Not at all

Perfectly

In what kind of environment is the solution to any problem a fixed solution of actions?

In an environment that is fully observable, deterministic, and known.

How well did you know this?

Not at all

Perfectly

What is an admissible heuristic?

One that never overestimates the cost to reach the goal. Thereofore, it is optimistic.

How well did you know this?

Not at all

Perfectly

How do we decide which node from the frontier to expand next?

Using a search function, such as best-first, breadth-first, or depth-first search.

How well did you know this?

Not at all

Perfectly

What are the 4 ways to measure the performance of a search algorithm?

Completeness
Cost Optimality
Time Complexity
Space Complexity

How well did you know this?

Not at all

Perfectly

What is the difference between an informed and uninformed search?

Informed knows how far the node is from the goal at any given step of the algorithm, whereas uninformed never has any idea of its distance from the goal.

How well did you know this?

Not at all

Perfectly

How can you implement a BFS using Best-First Search?

f(n) = n.depth

Using the node’s depth as the evaluation function.

How well did you know this?

Not at all

Perfectly

How can you implement a DFS using Best-First Search?

f(n) = -n.depth

Using the negative of the node’s depth as the evaluation function.

How well did you know this?

Not at all

Perfectly

How can you implement a Dijkstra’s using Best-First Search?

f(n) = n.pathCost

Using the node’s path cost as the evaluation function.

How well did you know this?

Not at all

Perfectly

What could go wrong if you use greedy best-first search for path-finding?

It is prone to getting stuck in local optima or dead ends. Because the algorithm always prioritizes nodes that appear to be closest to the goal, it may not explore other potentially useful paths or nodes that are further away from the goal but could ultimately lead to a better solution.

How well did you know this?

Not at all

Perfectly

Is A* cost-optimal?

It depends on the heuristic function. If the heuristic function is admissible or consistent, A* is cost-optimal. Otherwise it may not be cost-optimal.

How well did you know this?

Not at all

Perfectly

What is the performance difference between running hill-climbing search (or simulated-annealing) for k times and beam search of size k?

hill-climbing search with 5 threads could have essentially the same result. (the 4 that are bad will stay until the end because there is no way to prune them)

in local beam-search, bad options are pruned from the beginning, and you’re taking the k best

How well did you know this?

Not at all

Perfectly

Considering the vanilla implementation of MINIMAX, discuss the disadvantages and possible solutions to fix them.

Memory and speed problems due to the algorithm’s requirement to expand a node on the tree all the way to the bottom of the tree.
- Alpha-Beta pruning can help speed up the algorithm by preventing some nodes from even being calculated.
- Depth-limit will solve the problem. We use heuristics to estimate the possible score for non-goal states.

How well did you know this?

Not at all

Perfectly

What is a competitive Environment?

An environment in which two or more agents have conflicting goals.

How well did you know this?

Not at all

Perfectly

Adversarial agents can be considered one of which 3 environment stances?

an economy
part of the environment (making it non-deterministic)
explicitly modelled

How well did you know this?

Not at all

Perfectly

(Game Theory) Define transition model.

Study These Flashcards

Defines the state resulting from taking action ‘a’ in state ‘s’

RESULT(s, a);

(Game Theory) Define terminal test.

Study These Flashcards

A test which is true when the game is over and false otherwise.

IN_TERMINAL(s);

(Game Theory) What is a terminal state?

Study These Flashcards

States where the game has ended.

(Game Theory) What does a utility function define?

Study These Flashcards

The final numeric value (utility score) of player ‘p’ when the game ends in terminal state ‘s’

UTILITY(s, p);

(Game Theory) Which components make up the state space graph?

Study These Flashcards

The initial state, ACTIONS function, and RESULTS function.

(Game Theory) What is a state space graph?

Study These Flashcards

A graph where the vertices are states, the edges are moves, and a state might be reached by multiple paths.

(Game Theory) How do you determine which move to make using a state space graph?

Study These Flashcards

By superimosing a game tree over part of the state space graph.

(Game Theory) What is a game tree?

Study These Flashcards

A search tree that follows every sequence of moves all the way to a terminal state.

What is pruning?

Study These Flashcards

Removing/ignoring large parts of a game tree that make no difference to the outcome.

Describe a Weighted Linear Evaluation Function.

Computes numerical contributions from each feature (f _i) and then combines them using weights w _i from experience to find the total value.

What are the weaknesses of Alpha-Beta Search (that are addressed by Monte Carlo Search Tree)?

1. Alpha-Beta Search is limited in moves when the game tree has a large branching factor 2. The evaluation function is difficult to define for complex games like Go (where material value is not a strong enough indicator and most positions are in flux until the endgame)

How does a Monte Carlo Search Tree strategy assign value to states (instead of a heuristic evaluation function)?

The value of a state is estimated as the average utility over a number of **simulations** of complete games starting from the state.

What does a simulation (aka playout or rollout) do in the Monte Carlo Search Tree?

A simulation repeatedly chooses moves for one player then the other until a terminal position is reached. (The rules of the game, not heuristics, determine the score and the winner.)

What does the Monte Carlo Search selection policy do?

Determines which states to utilize by selectively focusing computational resources on Exploration or Exploitation of the game tree.

What are the four steps performed every iteration of a Monte Carlo Tree Search?

1. Selection 2. Expansion 3. Simulation 4. Back-propagation

What are the disadvantages of the Monte Carlo tree search?

1. It's likely that a single move can change the course of the game, but MC search might fail to consider it because of its stochastic nature 2. There are game states that are "obviously" a win to humans or an evaluation function, but MC search will still take many moves in a playout to verify the winner.

Problem-solving Agent

Plans ahead to find a sequence of actions that form a path to a goal state when the correct action is not immediately obvious.

4 Step Problem-solving Process

- Goal Formulation - Problem Formulation (desc. of states and actions necessary to reach goal) - Search - Execution

Lecture 3 Flashcards

(33 cards)