lecture 8 Flashcards

1
Q

What are examples of data that naturally form a sequence?

A

Language, music, stock prices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two main types of sequential datasets?

A

Numeric sequences and symbolic sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of a 1D numeric sequence?

A

Stock prices over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an example of a multidimensional numeric sequence?

A

Closing index values of AEX and FTSE100 over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an example of a 1D symbolic sequence?

A

A sequence of words in a sentence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an example of a multidimensional symbolic sequence?

A

A sentence where each word has multiple tags (e.g., part-of-speech tagging).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between a single sequence and a set-of-sequences?

A

A single sequence is continuous, while a set-of-sequences consists of independent sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is an example of a sequence classification problem?

A

Spam detection based on email content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an example of a sequence prediction problem?

A

Predicting future stock prices based on past trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can sequential data be transformed into a standard regression problem?

A

By representing each point using a fixed number of preceding values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is walk-forward validation?

A

A technique for evaluating models on time-ordered data without violating temporal structure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the main advantage of walk-forward validation?

A

It simulates real-world scenarios where new data arrives sequentially.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a Markov model?

A

A probabilistic model that estimates the probability of small subsequences from observed data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does a Markov model differ from Naive Bayes?

A

Markov models consider dependencies between elements in a sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the chain rule of probability?

A

p(W4, W3, W2, W1) = p(W4 | W3, W2, W1) * p(W3 | W2, W1) * p(W2 | W1) * p(W1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a key assumption in Markov models?

A

The probability of the next state depends only on the current state, not the entire history.

17
Q

What is the difference between a first-order and second-order Markov model?

A

A first-order model depends only on the previous state, while a second-order model depends on the two previous states.

18
Q

What is a practical application of Markov models?

A

Speech recognition and language modeling.

19
Q

How can probabilities be estimated in Markov models?

A

Using relative frequencies from observed sequences.

20
Q

What is a challenge of using raw frequency counts for probability estimation?

A

Some sequences may never appear in the training data, leading to zero probabilities.

21
Q

What technique is used to handle unseen sequences in Markov models?

A

Smoothing techniques such as Laplace smoothing.

22
Q

What is Laplace smoothing?

A

A method that adds a small constant to all frequency counts to prevent zero probabilities.

23
Q

What is a Hidden Markov Model (HMM)?

A

An extension of Markov models where states are hidden and only observed indirectly.

24
Q

What is an example of an HMM application?

A

Part-of-speech tagging in natural language processing.

25
Q

What are the two main components of an HMM?

A

The transition probabilities between states and the emission probabilities of observations.

26
Q

What is the Viterbi algorithm used for?

A

Finding the most likely sequence of hidden states in an HMM.

27
Q

What is the key advantage of Markov models in machine learning?

A

They efficiently model dependencies in sequential data.

28
Q

What is the main limitation of Markov models?

A

They assume limited memory, meaning only recent history is considered.

29
Q

What is the takeaway from Markov models?

A

They provide a simple yet powerful way to model sequential data in various domains.