Unit 3 - Evaluation And Visualization Of IR Flashcards

Question 1

Q

What is performance evaluation?

Answer

A

The measure of how much the retrieved documents are relevant to the users query.

Question 2

Q

Categories of the results obtained from a information retrieval system:

Answer

A

Relevant and retrieved
Relevant and not retrieved
Non-relevant and retrieved
Non-relevant and not retrieved

Question 3

Q

What are relevant items?

Answer

A

Document which are actually useful to the user.

Question 4

Q

What is precision?

Answer

A

Ratio of number of relevant and retrieved documents with the total number of retrieved documents

P = (A / (A + C))x100%

Question 5

Q

What is recall?

Answer

A

Ratio of number of relevant and retrieve documents to the total number of relevant documents in the database.

R = (A /(A + B))x100%

Question 6

Q

Problem with precision and recall

Answer

A

They are inversely proportional to each other
Need of detailed knowledge about The relevant items in the database

Question 7

Q

Mean reciprocal rank (MRR)

Answer

A

It is the rank aware evaluation metric i.e. it considers the relevance as well as the rank of the documents

Comprises of binary relevance based metric

Question 8

Q

Algorithm of MRR:

Question 9

Q

What is F-measure?

Answer

A

Measure that combines the recall and precision.
Also known as harmonic mean.

Question 10

Q

Normalise discounted cumulative gain (NDCG)

Answer

A

It is a measure of ranking quality.

Question 11

Q

Assumption that needs to be kept in mind for NDCG

Answer

A

Highly relevant documents are more useful than moderately relevant documents which are in turn more useful than irrelevant documents

Question 12

Q

To find NDCG, calculate

Answer

A

Cumulative gain
Discounted cumulative gain
Ideal discounted communicative gain
Normalised discounted cumulative gain

Unit 3 - Evaluation And Visualization Of IR Flashcards

(12 cards)