Chapter 3: Decision Making Flashcards by Dan Tav

What is a knowledge base made out of?

Sentences

How well did you know this?

Not at all

Perfectly

What is the syntax of sentences?

Propositional/Predicate logic

How well did you know this?

Not at all

Perfectly

What is meaning of sentences?

Semantics

How well did you know this?

Not at all

Perfectly

What does M(s) denote

All models where the sentence s is true.

How well did you know this?

Not at all

Perfectly

What does a entails b mean?

This means that if a is true then b is true. In model terms M(b) is a subset of M(a).

How well did you know this?

Not at all

Perfectly

What is inference?

Inference is the reasoning process of deriving logical consequences of premise

How well did you know this?

Not at all

Perfectly

What is an inference algorithm that derives entailed sentances?

Sound or truth preserving

How well did you know this?

Not at all

Perfectly

Why is soundness highest priority?

An unsound inference would make up stuff as it goes to reach it’s conclusion (useless)

How well did you know this?

Not at all

Perfectly

When is an inference algorithm complete?

If it can derive any sentence that is entailed

How well did you know this?

Not at all

Perfectly

When is a sentence valid?

If it is true in all models

How well did you know this?

Not at all

Perfectly

When is a sentence satisfiable?

If it is true in some models

How well did you know this?

Not at all

Perfectly

What is a proof?

A chain of conclusions that lead to a desired goal

How well did you know this?

Not at all

Perfectly

What is Monotonicity?

Set of entailed sentences can only increase if more information is added into knowledge base

How well did you know this?

Not at all

Perfectly

Can Proof by Resolution always find an answer if it exists?

True

How well did you know this?

Not at all

Perfectly

What is Proof by contradicition?

A proof by resolution technique, which adds the opposite of what we are trying to prove into the KB and tries to obtain an empty clause (contradiction).

How well did you know this?

Not at all

Perfectly

What does Reinforcement learning attempt to achieve?

Study These Flashcards

Many animals use rewards such as food in their environment as reward.

Reinforcement learning attempts to mimic this notion by making an agent that knows nothing about their environment achieve it’s goal.

What is an optimal policy?

Study These Flashcards

Actions agent can take that maximizes reward

Methods of obtaining optimal policy

Study These Flashcards

Monte Carlo Method: Full look-ahead (explore all consecutive actions)
Temporal Difference Method: One-step lookahead (Explore only one consecutive action)