08 Evaluation Flashcards

Question 1

Q

why do we need to evaluate

Answer

A

economic reasons
- how effective is the solution
scientific progress
- is their method better than competitors
verification
- verify performance

Question 2

Q

what do we need to evaluate

Answer

A

efficiency
- how fast
coverage
- how many pages is indexed
presentation
- effort required
effectiveness
- how correct is it

Question 3

Q

what is the IR experimental set up

Answer

A

maintain a test collection of docs, queries and relevance assessments using Ground truth
- measure of performance of precision, recall
- systems to compare for query TF vs TF-IDF
- experimental design

Question 4

Q

what are the assumptions for the evaluation

Answer

A

system provides a ranked list after searching the query
- a better system will provide a better ranked list
- a better ranked list generally satisfies the users

Question 5

Q

what is precision

Answer

A

retrieved docs that are relevant / all retrieved docs

Question 6

Q

what is recall

Answer

A

retrieved docs that are relevant / all relevant docs

Question 7

Q

ranking effectiveness

Answer

A

how many to rank? eg. top 1, 3, 5?
if precision at rank R is higher, recall will also be higher

Question 8

Q

what are the 3 methods of summarising ranking

Answer

A

calculate recall, precision at fixed rank positions
calculate precision at standard recall levels from 0.0 to 1.0
- interpolation
averaging precision values from the rank positions where a relevant document was retrieved

Question 9

Q

what is mean average precision (MAP)

Answer

A

summarise rankings from multiple queries by averaging average precision
- assume user is interested in finding many relevant documents for each query
- requires many relevance judgments in test collection

Question 10

Q

recall precision graphs

Answer

A

cannot show pattern

Question 11

Q

interpolation

Answer

A

defines precision at any recall level as the maximum precision observed in any recall-precision point at a higher recall level

into step function

Question 12

Q

joining average precision points at standard recall levels