Information Retrieval Flashcards

Question

IR Analysis: Efficiency vs Effectiveness

Answer 1

* Efficiency is primarly concerned with time * Query Latency vs Query Throughput * Indexing, query optimization, etc * Effectiveness is concerned with accuracy * Usually need some kind of feedback from the user

Answer 2

* Generally get feedback from the user * Implicit * Explicit * Pseudo Explicit * **_Explicit Feedback_** is provided by the user * Clicks, view time, etc * **_Implicit Feedback_** is drawn from the data * Add to search query a term that is frequent in the results

Answer 3

There are some well defined metrics: * Precision * Recall * Relevance * F-Measure (or F-Score) * Reciprocal Rank * Mean Reciprocal Rank(MRR) * Normalized Discounted Cumulative Gain (NDCG)

Answer 4

* Can be used to learn: * Maximize given metrics * Reinforce strategies * Can also use to determine effectiveness

Answer 5

**_False Negative_** Some relevant documents may not be retrieved **_False Positive_** Some irrelevant documents may be retrieved

Answer 6

The ratio of _desired tuples_ to the _tuples returned_ to the user * P@K is used for K returned results, or per page * Example: * If IR retrieved the top 10 results and 5 of them are relevant to the user * Then precision is 5/10 , or 1/2 * Max precision may not always be 1

Answer 7

**_Recall_** is the ratio of relevant tuples _returned_ to the user vs the number of relevant tuples _in the database_ * Example: * If IR returned the top 10 results * 5 of them are relevant * but there are 20 relevant tuples in the database * Recall is 5/20, or 1/4 * Can be trivially increased by returning more documents, * But this can decrease precision * May not always be possible to get recall of 1

Answer 8

* Recall can be trivially increased * Simply return more documents * Returning more documents can cause precision to decrease, so there is a tradeoff * The converse is also true: * Returning a single relevant tuple gives 100% precision, but this gives terrible recall * The F-Measure method provides a way to combine both Precision and Recall

Answer 9

F-Measure * Also called F-Score * Formula for combining the Precision and Recall metrics * Balances the trade-off between the two metrics

Answer 10

Applies a constant, **alpha** as a weight to get a metric that is balanced between Precision and Recall * F1 Measure: * alpha = 0.5 -\> beta =1 * Perfectly balanced between Precision and Recall

Answer 11

* Both metrics can be used to evaluate the effectiveness of retrieval systems over time * The system is usually queried for with more than a single intent * Take the total precision/recall and divide by the number of interactions * Gives an average * Good for tracking if the system is improving over time or if some change in the system worked

Answer 12

Precision, Recall and F-Measure don't take into account order. However, order of the results matters. * Users usually only look at the first few results * Need a way to measure satisfaction that also takes order into account * Metrics: * **Reciprocal Rank(RR)** * **Mean Reciprocal Rank(MRR)**

Answer 13

**_Reciprocal Rank (RR)_** The inverse position for the first relevant document **RR = 1 / pos_r** Used when we care about the position of the document in the returned results

Answer 14

Mean Reciprocal Rank (MRR) The average of the inverse of the position for the first relevant document Useful for multiple interactions, especially when there is only a single relevant result.

Answer 15

* How well a document matches what a user is actually looking for * Some documents may be related, but not actually what they are looking for * Basic: Relevance Judgement Scores * Files that are created/learned by the system over time * Rank documents from 0-4 on a given intent * Expensive and biased * Best to use a simple binary model, yes or no(click or no click) * More Advanced: NDCG

Answer 16

* Precision * Single Query * Over time * Recall * Single Query * Over time * Order * Relevance of document

Answer 17

Normalized Discounted Cumulative Gain (NDCG) A measure of the relevance of a document * Range between 0-1 * Normalized from DCG * DCG gives a larger score for documents with high relevance * But, DCG is unbounded * Normalized by dividing DCG by IDCG * NDCG = DCG / IDCG

Answer 18

Discounted Cumulative Gain (DCG) Gives larger score for documents with high relevance. Ideal DCG(IDCG) is calcualted with the tuples ordered in optimal order based on relevance. Used for normalizing DCG.

Answer 19

* Stopping * Stemming * Proximity/Clustering * N-Grams * Synonyms * Homonyms

Answer 20

Removes extremely common terms from the retrieval method or index * Some terms are so common that they are not useful: * "the" "and" "a" , etc * The most frequently occurring words make up the majority of the size of the documents * Uses Zipf's Law to determine what words count as "stop words" * Results in less terms to search over * More accurate keyword searches

Answer 21

Reduces some words to their "stem" prior to searching * User's keyword queries may not be an exact match, but rather some variant of the word * Plurals, past tense, suffixes, prefixes, etc * Stemming removes prefixes and postfixes * Can reduce the size of the inverted index * May use Table Lookup to find minimum form of different terms * NLTK library does this * Not always effective

Answer 22

* Idea: * Keywords that are in the query might appear close to each other in the document * The closer they are to each other, the more important the document is * Vice Versa * Can also consider the **order** of the keywords * One way to take advantage of this is the use of N-Grams

Answer 23

* Basic Idea: Clustering characters together * Not too useful for this context * Better: Clustering words together * Given: "Hello World. Hello Ben" * 2-Grams: * "Hello World", "World. Hello", "Hello Ben" * 3-Grams: * "Hellow World. Hello" , "World. Hello Ben"

Answer 24

_Synonyms_ Consider synonyms for come terms the user includes in the query: * Example: Document: "My instructor" , Query: "teacher" * Expand query to include both terms "teacher or instructor" * Can be complicated to expand too much, many terms have synonyms that are not appropriate to the user's intent _Homonyms_ Query may include a homonym of the term the user actually intended. Can also be tricky.

Information Retrieval Flashcards

(48 cards)