Week 1 Flashcards

1
Q

What is text retrieval

A

Having a collection of text documents.
Sub form of information retrieval.
Imperial problem evaluated by users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do search engines return

A

Relevant documents picked up by the TR systems to users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Text retrieval vs SQL retrieval

A

Free text vs structured data
Ambiguous vs rigorous semantics
Retrieve relevant docs vs matched records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is vocabulary

A

V = { w1, w2 } - All words in the doc collection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is query

A

Q

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is document

A

Di

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is collection

A

C = {d1, d2} - list of documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is word count

A

C(w, d) - counting frequency of word w in d

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is set of relevant documents

A

R(q) subset of C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is TR task

A

R’(q) -> R(q)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

List two TR strategies

A

Document ranking and document selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is document selection

A

R’(q) = { d E C |f(d,q) = 1 } where f(d,q) E {0,1}

Chosen or not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is document ranking

A

R’(q) = { d E C |f(d,q) > 0 } where f(d,q) E R - R is relevance measure function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly