Midterm Review Flashcards
(104 cards)
What does NLP stand for?
Natural Language Processing
What are the two main categories of tasks in NLP?
- Pre-neural
- Post-neural and pre-LLM
- Post-LLM
What is the purpose of sequence tagging/classification tasks?
They mostly serve as features for downstream applications
What is the role of Part of Speech Tagging in NLP?
Widely adopted as features in ML models
What does morphology tagging involve?
Lemmatization and reducing words to their simplest form
What is Dependency Parsing used for?
Understanding sentence structure and semantic disambiguation
What does Semantic Role Labeling (SRL) identify?
Identifies predicates and roles of elements in sentences
What is Named Entity Recognition (NER)?
Identifies all named entities in text that belong to certain types
What type of models are Markov Chains?
Chains of states with transition probabilities
What is a language model?
A model that predicts the next word/token in a sequence
What is the objective of a language model?
Compute the probability of a sentence or sequence of words
Define perplexity in the context of language models.
Inverse probability of test text normalized by the number of words
What are some common classification tasks in NLP?
- Document Classification
- Spam Detection
- Sentiment Analysis
- Textual Entailment
What is the goal of generative classifiers?
To model P(x, y) = P(x | y) P(y)
What is the main advantage of discriminative classifiers?
Few assumptions and more direct modeling
What is the function of the softmax function in multi-class logistic regression?
Converts logits into probabilities that sum to 1
Fill in the blank: In Naive Bayes, you estimate P(x|c) and P(c) for classification based on _______.
features
What is the purpose of TF-IDF in text representation?
To weigh the importance of a term in a document relative to the corpus
What is Word2Vec used for?
To train a classifier to predict word embeddings
What is the concept behind the skip-gram architecture in Word2Vec?
Uses nearby words to predict the target word
What is the primary function of a neural unit in a neural network?
To carry information and produce a non-linear activation value
What is the significance of using non-linear activation functions in neural networks?
They enable the network to approximate complex functions
What does RNN stand for?
Recurrent Neural Network
What is the purpose of training an RNN?
To handle sequences and maintain context from previous inputs