Search in complex environments Flashcards

1
Q

What is meant by hill-climbing in the context of search?

A

Going towards a local (or if lucky global) maximum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Gradient descend and hill-climbing are opposites. What is the purpose of gradient descend?

A

Finding a global minimum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What characterizes hill-climbing search?

A

Moves to neighbouring state with highest value. Terminates when it reaches a peak.

Optimal: No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is meant by a complete-state formulation?

A

A state-formulation where every state has the components for a solution, but may not be in the right place. Example: n-queens.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What characterizes stochastic hill-climbing?

A

It chooses random uphill moves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What characterizes first-choice hill-climbing?

A

It generates successors randomly until one is better than the original state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What characterizes random-restart hill climbing?

A

Variation of first-choice hill-climbing, where it restarts if it doesn’t succeed.

Complete: Yes, because it will eventually generate a goal state as the initial state.

Number of restarts: 1/p

Number of steps: Cost of one successful iteration + (1-p)/p times the cost of failure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the concept of simulated annealing based on?

A

Blacksmithing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What characterizes simulated annealing?

A

Combines hill-climbing and random walks. A balance between exploration and exploitation.

Picks random moves if move improves situation, else accepts move with probability less than 1.

Probability of random move decreases exponentially with the “badness” of the move.

Probability decreases as temperature T goes down.

Bad moves more likely at the start when T is high.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the characteristics of local beam search?

A

Instead of keeping one node in memory at a given time, local beam search keeps track of k states.

Begins with k random states. At each step, all successors of all k states are generated, chooses the k best successors and repeat until it reaches a goal state or otherwise stopped.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the difference between local beam search and random-restart hill-climbing?

A

In local beam search information is stored between the search threads running in parallel.

In random-restart hill climb, each search process runs independently of each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What characterizes evolutionary/genetic algorithms?

A

Inspired by natural selection.

Selection: Selects individuals with probability proportional to fitness score or n individuals and select p most fit ones as parents.

Crossover: Randomly select a crossover point to split the parent strings, and recombine to form two new children.

Mutation rate: How often a mutation should happen (to a single value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is meant by an empirical gradient method?

A

A method that measures progress by change in value of an objective function between two nearby points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which concept is useful when dealing with non-deterministic environments?

A

Belief states: States the agent believes are possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the difference between AND- & OR-nodes in an AND/OR-tree

A

An OR-node doesn’t have to be expanded instantly. All states in an AND-node has to be expanded.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Can an agent act rationally in a sensorless environment?

A

Yes it can. For instance a vacuum cleaner will still be able to make a plan for cleaning if it can’t tell whether the tile it is on is clean.

17
Q

What is the difference between offline- and online search agents?

A

Offline search agents: Computes complete solution before taking an action.

Online search agents: Takes an action, observes environment, computes next action, repeats. (Used for dynamic environments. Can also be useful for non-deterministic domains. Necessary for unknown environments)

18
Q

What is meant by the competitive ratio?

A

A value to compare the online algorithm to the most optimal path.

19
Q

What is a precondition for using Online DFS (depth-first search)?

A

State space needs to be safely explorable.

20
Q

What makes LRTA* (Learning-real-time A) different from A?

A

It stores a current best estimate, H(s) of the cost to reach the goal from each state it has visited, and updates it for every new state it reaches, as it gets more information.