L10: Topic Modelling Flashcards

1
Q

Topic modelling objective: a tool for organization of information
TRUE/FALSE

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which of the following are steps for topic modelling?
A) Discover the thematic structure: which themes do the documents belong to?
B) Annotate the documents according to themes
C) Use the annotations to organize, summarize, search and form predictions

A

ALL ARE CORRECT
A) Discover the thematic structure: which themes do the documents belong to?
B) Annotate the documents according to themes
C) Use the annotations to organize, summarize, search and form predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Topic modelling provides methods for automatically organizing, understanding, searching, and summarizing large electronic archive
TRUE/FALSE

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Topic models helps determine the probability that each document is associated with a given theme or topic.

TRUE/FALSE

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Latent Dirichlet Allocation (LDA) is a probabilistic model used in topic modeling to discover underlying topics within a collection of documents; it assumes that each document is a mixture of topics, and each topic is a mixture of words, providing insights into the thematic structure of the text corpus.

TRUE/FALSE

A

TRUE
The output of LDA: it produces the probability that each document within the corpus is associated with each of the k topics specified by the user

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Structural Topic Modelling (STM) is very similar to LDA, but it employs meta data on top (data that provides information about other data, e.g., characteristics, properties) about documents

TRUE/FALSE

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Name of the author and date in which the document was produced are examples of?

A

Metadata used in Structural Topic Modelling (STM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the utility of stLDA-C?

A

stLDA-C is useful for topic modelling for short texts where LDA usually performs poorly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the primary goal of topic modelling?

A

Identifying hidden thematic structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In topic modelling, what does a “topic” represent?

A

A cluster of documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the purpose of the term “bag-of-words” in topic modelling?

A

It ignores the order of words and considers only their frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of using a term-document matrix in topic modeling?

A

It represents the relationship between frequency of given terms/ words and documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the primary objective of Structural Topic Modeling (STM)?

A

Analysing the relationship between topics and document metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Understanding the emotional sentiment expressed in text can be facilitated by _____?

A

Sentiment analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly