Cram1 Flashcards
The agent perceives ______ acts according to a ______
environment, performance criteria
Performance criteria are
domain dependent
Possible performance criteria:
m2 per hour
how clean is the room after vacuuming
power consumption
Ideal rational agent
The agent chooses an action which maximizes its performance for a given
percept sequence and knowledge about the world.
to avoid trivialization use ____
active sensing
Categorization of Agents
Percepts: Symptoms, findings, patient’s answers,
Actions: Questions, tests, treatments,
Goals (performance measures): Healthy patient, minimize costs,
Environment: Patient, hospital
The goal of learning
Optimize future behavior on the basis of the history of percepts, actions, and knowledge about the world.
accessible vs. nonaccessible
Are all relevant aspects of the world accessible to the sensors?
deterministic vs. nondeterministic/stochastic
Does the next state depend completely on the current state and the action chosen.
episodic vs. nonepisodic
Does the choice of an action depend only on the current state or also on the past?
static vs. dynamic
Can the world change while deciding on the next action?
discrete vs. continuous
Is the world discrete (as in chess) or not (mobile robots)?
Properties of Environments
accessible, deterministic, episodic, static, discrete
A goal is
a set of world states, which the agent finds desirable (wants to reach one of them).
Search
Finding an action sequence which transforms an initial state into a goal state.
Single-state problem
complete world knowledge,
complete knowledge about the actions
Multiple-state problem
incomplete world knowledge,
complete knowledge about the actions
Contingency problem
incomplete knowledge about actions,
needs to gather information at run-time
Exploration problem
World states and effect of actions are
both unkown. Very difficult!
Initial state
World state which the agent believes to be in initially
State space
Set of all possible states
Operator
Description of which state is reached by an action from a given state.
Successor function S
S(x) returns the set of states reachable by any action from state x