Week 1 Flashcards

1
Q

What does a computer have to do in order to understand a natural language sentence?

A
  • Lexical analysis (POS tagging)
  • Syntactic analysis: For structure of the sentence
  • Semantic analysis: Understanding meaning of words using some kind of digital representation
  • Inference: Extra knowledge inferred from the original text
  • Pragmatic analysis: All text has a reason to be. All human generated text has some objective that can be analyzed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is ambiguity?

A

The quality of being open to more than one interpretation; inexactness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is natural language processing (NLP) difficult for computers?

A

Computer don’t have knowledge bases as humans do. Natural language has not been designed for computers. Natural language omits information as humans assume somethings are not necessary to say explicitly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is bag-of-words representation? Why do modern search engines use this simple representation of text?

A

Is representing a document as an unordered set of words. They use this method because it’s often sufficient for most search tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two modes of text information access? Which mode does a web search engine such as Google support?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two modes of text information access? Which mode does a web search engine such as Google support?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When is browsing more useful than querying to help a user find relevant information?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is a text retrieval task defined as a ranking task?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a retrieval model?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two assumptions made by the Probability Ranking Principle?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Vector Space Retrieval Model? How does it work?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we define the dimensions of the Vector Space Model? What does “bag of words” representation mean?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the retrieval function intuitively capture when we instantiate a vector space model with bag of words representation and bit representation for documents and queries?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

“Bag of words” representation

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Push, pull, querying, browsing

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Probability ranking principle

A
17
Q

Relevance

A
18
Q

Vector space model

A
19
Q

Dot product

A
20
Q

Bag of words representation

A
21
Q

Bit vector representation

A