MS cheat cheet Flashcards
1
Q
Give all text Analytics methods on the cheat sheet
A
- Extract N-gram features from text.
- Feature Hashing
- Preprocess Text
- Word2Vector
2
Q
Explain Extract N-gram features from text
A
Creates a dictionary of n-grams
from a column of free text
3
Q
Explain Feature Hashing
A
Converts text data to integer
encoded features using the
Vowpal Wabbit library
4
Q
Explain Preprocess Text
A
Performs cleaning operations on text,
like removal of stop-words, case
normalization
5
Q
Explain Word2Vector
A
Converts words to values for use in
NLP tasks, like recommender, named
entity recognition, machine
translation