Exam Questions Flashcards

1
Q
A

Vergeet dus niet dat je hier twee verschillende initialisaties hebt. Daarom moet je ook het aantal goed beantwoorde vragen toevoegen.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Wat zou je hier moeten doen?

A

Probeer gewoon hier een mogelijk antwoord, itereer door als nodig.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to solve for which ɑ one type of strategies is better then another one?

A

Solve x1, x2 and y1, y2, and then set x1 ≤ y1 and x2 ≤ y2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to solve using iteration?

A

Solve the current system, then fill in the current values in the minimization formula’s. Those are the values to want to minimize/maximize.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How to show that a Markov chain is unichain?

A

Claim/show that the Markov chain will eventually become some sort of cycle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to write down the transitition functions in a DP model?

A

Just write down t(state, action) = new state, do this for every possible action. In the case of probabilities make sure you write the probability afterwards.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to do value iteration?

A
  1. Start with the values V0 = (0, 0, ..) (usually, sometimes different starting values are defined.
  2. Calculate Vn(i) = min a in A(i) {c(i, a) + 𝛼∑pij(a)Vn-1(i)} for all i in I.
  3. Write down Rn(i), which is the minimum value

Obv. if you want to maximize change min into max

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to do policy iteration?

A
  1. Solve current system (thus with current actions), system xi = min a in A(i) {c(i, a) + ∑pij(a)xj}
  2. Fill in these new x’s in the system and find minimum again (you can put this step in a table, s.t. you can easily see which is best, though putting it in a min {.} statement is also possible.
  3. Stop when you get that the previous x1 is the same as the current x1, etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do we know about the value of g, after value iteration?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly