L16 - Word Embedding Flashcards
What issue does Word Embedding solve?
Topic Modellings inability to work effectively on shorter texts.
Give a high level explanation of the concept of Word Embedding…
Converts words into numeric vectors. Similar words can then be represented by the same numeric value within the Word Embedding.
Give the step by step process of how it works…
- Apply one hot vector transformation to all words in the corpus.
- Input vectors into a 2D neural-network. Input and output vectors will alwars represent 1 word has a complexity of 2V + N (V is input vecotrs, N is hidden layer)
- Input vectors are fed to activation function along with their associated weight.
- Each input vector now corresponds with a value in Word Embedding (middle vector). Similar words are given similar or same value. Thus, next word can be predicted.
What are 2 adaptations to improve the Word Embedding model?
Continuous Bag of Words Model - As opposed to a single vector for each input, use a set of vectors (set of words) to establish context for a more accurate output.
Skip-gram model - Enables the prediction of more than 1 word.
What is the goal of Word Embedding?
We want to establish a low dimension Word Embedding (weight vector) which enables the prediction of next words.
What are the 2 main properties of Word Embeddings?
- Numerical Vectors - Represent words as numerical vectors.
- Vector Space Relationships - In the mathematical vector space, closeness of vectors represent the similarities of the words the vectors represent.
What is the main issue of Word Embedding? What is the solution to this?
Polysemy - Some words can be viable for multiple embeddings due to multiple meanings depending on the context. For example, Queen could be royalty, music and cards. If the text contains all these themes, no clear relationship can be established between the word and document.
Solution : Sense Embedding
How does Sense Embedding solve the Polysemy issue of Word Embedding?
Construct embeddings for each sense that the word can be used in.
Feed the model with many labeled examples in which the word is used.
The model can then derive more detailed context of when the word is used and it’s associated data via the appropriate embedding.
What does sentence / document embedding mean?
The closer proximity of sentence or document vectors implies that they are more similar.
What are some applications of Word Embedding?
Automatic translation
Text summarisation
Clustering similar texts