Vector Space Models in Information Retrieval and for Word Meaning Flashcards
1
Q
Vector Space Models
A
Assume that meaning can be modeled by context
Represent the topic by counting how often each word occurs
Word counts represented as vectors
Similarity in meaning as proximity in space
2
Q
Applications of vector space models
A
- Web search
- Automatic thesaurus creation
3
Q
Euclidean distance
A
distance (P, Q)
= sqrt (sum from 1 to n (Pi-Qi)^2)
Does not normalize the number of counts
4
Q
Cosine similarity
A
Look at the angle between the vectors
Built-in normalization