Dynamic probabilistic models Flashcards

Question 1

Q

Describe how we view the world when using these models

Answer

A

We view the world in snapshots or time slices

Each time slice contains a set of random variables

Some of these are hidden and some are observable

Question 2

Q

How do we denote the state variables at time t?

Answer

A

These unobservable variables are denoted as:

X_t

Question 3

Q

How do we denote the observable evidence variables at time t?

Answer

A

E_t

where the observationa at time t is:

E_t = e_t for some values e_t

Question 4

Q

What is the example we use to understand the situation of rain?

Answer

A

You are a security guard in an underground location.

You want to know if it is raining today, you can also know that by seeing if someone brings an umbrella or not

For each day t, the set E_tcontains a single evidence variable,

Umbrella_t / U_t

The set X_tcontains a single state variable

Rain_t / R_t

We then assume that the time betweens lices are fixed

and that the evicence starts arriving at t = 1

Question 5

Q

What is the difference between a first order markov process and a second order?

Answer

A

See picture

Question 6

Q

Explain the transition model

Answer

A

The transition model specifies the probability distribution over the lastest state variables, given previous values.

Can be written as:

P(X_t | X_0:t-1)

This would become increasingly large, but with the markov assumption we can write following for first order markov processes:

P(X_t | X_0:t-1) = P(X_t | X_t-1)

Extra info:

for second order markov processes it would be:

P(X_t | X_0:t-1) = P(X_t |X_t-2, X_t-1)

Question 7

Q

Explain the sensor model

Answer

A

See figure

The evidence E could depend on previous variables as well as the current state variable X.

We can make the following sensor markov assumptions:

P(E_t| X_0:t, E_0:t-1) = P(E_t | X_t)

So the sensor model will therefore be:

P(E_t | X_t)

Question 8

Q

Why does the arrows go from X to E?

Answer

A

The arrows go from the actual state of the world to the sensor values because the rain causes the umbrella to appear.

Bonus info:

For inference this goes the other direction

Question 9

Q

What are the requirements for the markov process?

Answer

A

Prior probability distribution

transition model

sensor model

Question 10

Q

What are the basic inference tasks for a

markov process

/

dynamic models?

Answer

A

Filtering

prediction

smoothing

most likely estimation

Question 11

Q

What is filtering?

Answer

A

Compute the belief state – the posterior distribution of the current state – given the evidence so far, i.e., P(X_t | e_1:t). For example, P(Raint | umbrella_1:t).

In our example this would mean computing the probability of rain today, given all the observations of the umbrella carrier made so far.

filtering is what a rational agent does to keep track of the current state, so that rational decisions can be made.

Question 12

Q

What is prediction?

Answer

A

This is the task of computing a posterior (fremtidig!) distribution over the future state given all the evidence that we have so far.

So we wish to compute P(X_t+k | e_1:t) for some k > 0

In the umbrella example, this might mean computing the probability of rain three days from now, given all evidence to date.

Prediction is usefull for evaluating possible courses of action based on their expected outcomes

Question 13

Q

What is smoothing?

Answer

A

This is the task of computing the posterior (fremtidig!) distribution over a past state given all the evidence so far. So we might compute:

P(X_k | e_1:t)

for some k such that

0 <= k <= t.

In other words: computing the probability that it rained wednesday, given all the observations of the umbrella carier made up to today.

why?

Smoothing provides a better estimate of the state than what was available at that time, because it incorporates more evidence.

Question 14

Q

What is most likely explanation?

Answer

A

Given a sequence of observations, we might wish to find the sequence of states that is most likely to have generated those observations. That is, we wish to compute

argmax_{<strong>x</strong>1:t}( P(x_1:t | e_1:t)

For example, if the umbrella appears on each of the first three days and is not there the fourth, then the most likely explanation is that it rained on the three days and not on the fourth.

Bonus info:

this is also usefull in the field of speech recognition.

Question 15

Q

How do we perform filtering?

Answer

A

See figure

https://gyazo.com/65025743759cbf9429cd9462db282e68

get prior probability

https://gyazo.com/1f225f8079f051f1f0917f4cdcb54bc7

predict R₁

https://gyazo.com/14b249e5bcc573c5e250ea4485aab559

Update by multiplying this with the probability of evidence and normalise

https://gyazo.com/19e8c6a4e2d30f3bb029a27390b6c5e7

we normalize <0.45, 0.1> by saying:

<0.45 / (0.45 + 0.1) , 0.1 / (0.45 + 0.1)

For day two we now have the last update and use that instead of the prior probability.

https://gyazo.com/a1fdc4441bbf996e8ab37e18969732af

Question 16

Q

How do we do prediction?

Answer

Study These Flashcards

A

The task of prediction can be seen as filtering without the addition of new evidence.

https://gyazo.com/6c6a75dd423882ee2e2ca310371b1738

Bonus info: when k becomes very large (see out in the future) the distribution for rain converges to 0,5/0.5 again.

Question 17

Q

What are the three components for filtering?

Answer

Study These Flashcards

A

https://gyazo.com/75c4febb3d73e60694d253fb8df65860

Sensor model

transition model

last update

Question 18

Q

What are the components of the update function?

Answer

Study These Flashcards

A

Transition model

Last update

Question 19

Q

What are the components of the smoothing function?

Answer

Study These Flashcards

A

backwards message

forward message

https://gyazo.com/e03237a9d30d6297c1915d5a96f41c01

Question 20

Q

How do we perform smoothing?

Answer

Study These Flashcards

A

Recursive process: follow the examples here:

Step one:

https://gyazo.com/3d045798aa7dd7c1253dd1f915c55d7c

Step two initial backwards message:

https://gyazo.com/b155cadbfcc0287465d89bfb51f4a7aa

Step 3: calculate probability given forward and backward
https://gyazo.com/9ac4f440c90a1c8e6d6e3714e98378ef

Step 4: calculate backward for b_2:2

https://gyazo.com/0855da7a239fc43580d298b0cbc2a59c

Step 5 (last): calculate probability for R₁

https://gyazo.com/b57baf95a2ebb1a008cd0c8e5c4ada22

Question 21

Q

What are some extra observations about smoothing?

Answer

Study These Flashcards

A

The forward and backward recursions take constant time per step (for a single k) O(t).

How can we make the algorithm run in time O(t) for all k? Simply cache the results of the forward procedure before starting with the backwards procedure

The algorithm is also called the Forward-backward algorithm.