Lecture 13 Flashcards

1
Q

What is clustering in machine learning?

A

An unsupervised learning method used to group similar data points together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is k-means clustering?

A

An iterative algorithm that assigns data points to k clusters based on their similarity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does k-means clustering determine the number of clusters?

A

The number of clusters (k) is an input parameter chosen by the user.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a disadvantage of k-means clustering?

A

It can get stuck in local minima and is sensitive to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the silhouette coefficient?

A

A measure between -1 and 1 that evaluates clustering quality by comparing intra-cluster and inter-cluster similarity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an artificial neural network (ANN)?

A

A computational model inspired by biological neural networks that learns patterns from data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a perceptron?

A

A type of artificial neuron that makes binary classifications using a weighted sum of inputs and an activation function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the purpose of an activation function in a neural network?

A

To introduce non-linearity into the model, allowing it to learn complex patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the sigmoid activation function?

A

A function that maps input values to a range between 0 and 1, often used in binary classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a feedforward neural network?

A

A type of neural network where data moves in one direction from input to output without cycles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is backpropagation?

A

A learning algorithm that adjusts neural network weights by computing errors and propagating them backward.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a loss function in neural networks?

A

A function that measures the difference between the predicted output and the actual output.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is gradient descent?

A

An optimization algorithm used to minimize the loss function by updating model parameters iteratively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is stochastic gradient descent (SGD)?

A

A variant of gradient descent that updates model parameters using a single sample or a small batch at a time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a deep neural network?

A

A neural network with multiple hidden layers that can learn complex representations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the difference between supervised and unsupervised learning?

A

Supervised learning uses labeled data, while unsupervised learning finds patterns in unlabeled data.

17
Q

What is a multilayer perceptron (MLP)?

A

A type of feedforward neural network with multiple hidden layers.

18
Q

What is softmax activation used for?

A

It converts logits into probabilities for multiclass classification.

19
Q

What is a generative pre-trained transformer (GPT)?

A

A deep learning model trained on large datasets to generate human-like text.

20
Q

How is GPT trained?

A

Through unsupervised pre-training on large text corpora followed by supervised fine-tuning.

21
Q

What is beam search in GPT models?

A

A search algorithm that finds the most likely sequence of words when generating text.

22
Q

Why do neural networks require large amounts of data?

A

They have many parameters that need to be trained to generalize well to unseen data.

23
Q

What is overfitting in neural networks?

A

When a model learns patterns specific to the training data and fails to generalize to new data.

24
Q

What is the purpose of model selection in clustering?

A

To choose the best clustering model based on the data and the intended application.

25
Q

Why is choosing the right number of clusters (k) challenging?

A

There is no universal method for selecting k, and different values can lead to different interpretations.