Chapter 3: Decision Making Flashcards

1
Q

What is a knowledge base made out of?

A

Sentences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the syntax of sentences?

A

Propositional/Predicate logic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is meaning of sentences?

A

Semantics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does M(s) denote

A

All models where the sentence s is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a entails b mean?

A

This means that if a is true then b is true. In model terms M(b) is a subset of M(a).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is inference?

A

Inference is the reasoning process of deriving logical consequences of premise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an inference algorithm that derives entailed sentances?

A

Sound or truth preserving

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is soundness highest priority?

A

An unsound inference would make up stuff as it goes to reach it’s conclusion (useless)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When is an inference algorithm complete?

A

If it can derive any sentence that is entailed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is a sentence valid?

A

If it is true in all models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When is a sentence satisfiable?

A

If it is true in some models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a proof?

A

A chain of conclusions that lead to a desired goal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Monotonicity?

A

Set of entailed sentences can only increase if more information is added into knowledge base

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Can Proof by Resolution always find an answer if it exists?

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Proof by contradicition?

A

A proof by resolution technique, which adds the opposite of what we are trying to prove into the KB and tries to obtain an empty clause (contradiction).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does Reinforcement learning attempt to achieve?

A

Many animals use rewards such as food in their environment as reward.

Reinforcement learning attempts to mimic this notion by making an agent that knows nothing about their environment achieve it’s goal.

17
Q

What is an optimal policy?

A

Actions agent can take that maximizes reward

18
Q

Methods of obtaining optimal policy

A
  • Monte Carlo Method: Full look-ahead (explore all consecutive actions)
  • Temporal Difference Method: One-step lookahead (Explore only one consecutive action)
19
Q

What is Markov Chain Rule?

A

Probability of reaching next state is only determined by current state

20
Q

What does Markov Decision Process Obey?

A

Markov chain rule

21
Q

What is Q-Learning?

A

Type of temporal difference reinforcement learning algorithm

22
Q

What does Q-Learning enable agents to do?

A

Act optimally in Markovian domains

23
Q

Does Q-Learning need to know it’s environment beforehand?

A

No

24
Q

What is Swarm Intelligence?

A

Multi-agent system inspired by insect colonies such as ants that work together.

25
Q

Does Ant Colony Optimization need to be changed for new problem domains?

A

False, but it could lead to a more optimal solution.

26
Q

Does Ant Colony Optimization need to be used in static environments?

A

False, can be used in scholastic time-varying environments)

27
Q

What can happen if agent always chooses action which maximizes Q-value?

A

Agent may pick non-optimal solution. Sticking to table is exploitation.