4. Information Retrieval & Relational Databases Flashcards

1
Q

What is Information Retrieval?

A

Finding documents that are about a given topic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a corpus?

A

An organised repository, or collection, of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is relevance?

A

Whether a retrieved document is actually about the requested topic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is precision?

A

The probability that a document is relevant given that it is retrieved
i.e Number of relevant documents retrieved / total number of documents retrieved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is recall?

A

The probability that a document is retrieved given that is is relevant
i.e Number of relevant documents retrieved / total number of relevant documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How would we compare IR algorithms? (2)

A
  • Average precision, which attempts to combine trade-off between precision and recall into a single value
  • Effectiveness measure, which combines precision and recall into a single value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Average Precision? (2)

A
  • Reflects the recognition that precision varies, expressing the variation as a graph (curve) of precision vs. recall
  • Attempts to summarize curve as single value for comparison
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Effectiveness Measure algorithm?

A

1 - 1 / a(1/P) + (1-a)(1/R)
a: alpha, 0=precision isn’t important, 1=recall isn’t important
P: Precision
R: Recall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly