lesson_13_flashcards

Question 1

Q

What is an embedding in machine learning?

Answer

A

A mapping of objects (e.g., words, nodes, images) into vectors in a continuous vector space, where proximity indicates similarity.

Question 2

Q

What are word embeddings?

Answer

A

Vector representations of words that capture semantic and syntactic relationships, learned from co-occurrence in text data.

Question 3

Q

What is the distributional hypothesis in NLP?

Answer

A

The idea that words appearing in similar contexts tend to have similar meanings, forming the basis of word embeddings.

Question 4

Q

What is Word2Vec?

Answer

A

A neural embedding model that predicts context words given a target word (skip-gram) or target word given context words (CBOW).

Question 5

Q

What are graph embeddings?

Answer

A

Learned vector representations of nodes in a graph that encode structural and relational properties for downstream tasks.

Question 6

Q

What is negative sampling in Word2Vec?

Answer

A

A training technique where negative examples (unrelated word pairs) are sampled to make training more efficient.

Question 7

Q

What are hierarchical embeddings?

Answer

A

Representations in hyperbolic space capturing hierarchical relationships, requiring fewer dimensions compared to Euclidean space.

Question 8

Q

What is fairness in embeddings?

Answer

A

Ensuring embeddings do not amplify or perpetuate biases present in the training data, as seen in cases like gender-biased word analogies.

Question 9

Q

What is PyTorch BigGraph?

Answer

A

A scalable framework for training embeddings on large graphs with billions of nodes and edges, using techniques like partitioning.

Question 10

Q

What is intrinsic evaluation of embeddings?

Answer

A

Evaluation based on internal properties, such as nearest neighbor quality or analogy tasks, to assess semantic and syntactic relationships.

Question 11

Q

What is extrinsic evaluation of embeddings?

Answer

A

Assessment based on performance in downstream tasks like classification, clustering, or recommendation.

Question 12

Q

What is matrix factorization in graph embeddings?

Answer

A

A method to decompose the adjacency matrix of a graph into low-dimensional latent factors, representing nodes as embeddings.

Question 13

Q

What is hierarchical softmax in Word2Vec?

Answer

A

A technique to efficiently compute probabilities in large vocabularies by using a binary tree structure.

Question 14

Q

What is contextual word embedding?

Answer

A

Embeddings like BERT and ELMo that dynamically generate word representations based on the surrounding context in a sentence.

Question 15

Q

How do embeddings enable recommendation systems?

Answer

A

By representing users and items in the same space, embeddings help predict preferences and recommend similar items.

lesson_13_flashcards

(15 cards)