Question 1

What is model class selection in machine learning?

Accepted Answer

The process of choosing the appropriate model type for a given problem, such as classification, regression, or clustering.

Question 2

What is the purpose of a hyperplane in classification?

Accepted Answer

A hyperplane separates different classes in a feature space, with its dimension being one less than the number of features.

Question 3

What is the maximum margin hyperplane?

Accepted Answer

A hyperplane that maximizes the margin between different classes, used in Support Vector Machines.

Question 4

Why is feature selection important in machine learning?

Accepted Answer

Choosing relevant features improves model accuracy and reduces overfitting.

Question 5

What is overfitting?

Accepted Answer

When a model is too complex and fits the training data too well, leading to poor generalization on new data.

Question 6

What is underfitting?

Accepted Answer

When a model is too simple and fails to capture patterns in the training data, leading to poor performance.

Question 7

What is a design matrix?

Accepted Answer

A matrix representing all features of a dataset, allowing transformations for improved model performance.

Question 8

What is the role of mean squared error (MSE) in regression?

Accepted Answer

MSE measures the average squared difference between predicted and actual values, serving as a loss function.

Question 9

What is Occam’s Razor in model selection?

Accepted Answer

The principle that the simplest model that fits the data should be preferred.

Question 10

Why do we use high-dimensional feature spaces?

Accepted Answer

Transforming data into higher dimensions can make it linearly separable, improving classification performance.

Question 11

What is gradient descent?

Accepted Answer

An optimization algorithm used to find the minimum of a function by iteratively updating parameters.

Question 12

What is the difference between local and global minima?

Accepted Answer

A local minimum is a point where the function is lower than nearby points, while a global minimum is the lowest possible value.

Question 13

How does stochastic gradient descent (SGD) differ from regular gradient descent?

Accepted Answer

SGD updates parameters using a single random data point or a small batch, making it faster but less stable.

Question 14

What is convex optimization?

Accepted Answer

A type of optimization problem where the loss function has a single global minimum, making it easier to solve.

Question 15

How do we prevent a model from getting stuck in local minima?

Accepted Answer

By using techniques like momentum, learning rate adjustments, or trying different starting points in gradient descent.

Lecture 12 Flashcards

(25 cards)