Week 8 UAS Flashcards
the agent’s preferences are captured by?
utility function
what is expected utility?
an action given the evidence
The Basic of Utility Theory?
- Orderability
Given any two lotteries, a rational agent must either prefer one to the other or else rate the two as equally preferable. - Transitivity
Given any three lotteries, if an agent prefer A to B and prefer A to C, then the agent must prefer A to C. - Contiunity
if some lottery B is between A and C in preference, then there is some probability p for which the rational agent will be indifferent between getting B for sure and the lottery that yields A with probability p and C with probability 1-p. - Monotonicity
suppose two lotteries have the same two possible outcome A and B. If the agent prefer A to B, the agent must prefer the lottery that has a higher probability for A (and vice versa). - Decomposability
compound lotteries can be reduce to simpler ones using the laws of probability.
The purpose for using utility theory in decision making?
to create a mathematical model to aid the process. It gives the decision maker the ability to quantify the desirability of certain alternatives.
Decision Network?
Decision network represents information about the agent’s current state, its possible actions, the state that will result from the agent’s action, and the utility of that state.
the three types of nodes in Decision Network?
- Chance nodes (ovals) represent random variables, just as they do in Bayesian networks. The agent could be uncertain about the construction cost, the level of air traffic and the potential for litigation, and the Deaths, Noise, and total Cost variables, each of which also depends on the site chosen
- Decision nodes (rectangles) represent points where the decision maker has a choice of actions. In this case, the AirportSite action can take on a different value for each site under consideration. The choice influences the cost, safety, and noise that will result.
- Utility nodes (diamonds) represent the agent’s utility function. 9 The utility node has as parents all variables describing the outcome that directly affect utility. Associated with the utility node is a description of the agent’s utility as a function of the parent attributes
What is probability theory?
Probability theory describes what an agent should believe on the basis of utility theory describes what an agent wants, and decision theory puts the two together to describe what an agent should do.
What is Markov Decision Process?
A sequential decision problem for a fully observable, stochastic environment with a Markovian transition model and additive rewards
Sequential decision problems menggunakan function apa?
additive utility function
What is Value Iteration?
Value iteration Algorithm for calculating an optimal policy. The basic idea is to calculate the utility of each state and then use the state utilities to select an optimal action in each state.
Game Theory can be used in at least two ways?
- Agent design
2. Mechanism design
Markov decision processes (MDP) are defined by?
- transition model
2. reward function
What is game theory?
Game theory describes rational behavior for agents in situations in which multiple agents interact simultaneously.