Lecture 12 Flashcards

1
Q

What is model class selection in machine learning?

A

The process of choosing the appropriate model type for a given problem, such as classification, regression, or clustering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the purpose of a hyperplane in classification?

A

A hyperplane separates different classes in a feature space, with its dimension being one less than the number of features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the maximum margin hyperplane?

A

A hyperplane that maximizes the margin between different classes, used in Support Vector Machines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is feature selection important in machine learning?

A

Choosing relevant features improves model accuracy and reduces overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is overfitting?

A

When a model is too complex and fits the training data too well, leading to poor generalization on new data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is underfitting?

A

When a model is too simple and fails to capture patterns in the training data, leading to poor performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a design matrix?

A

A matrix representing all features of a dataset, allowing transformations for improved model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the role of mean squared error (MSE) in regression?

A

MSE measures the average squared difference between predicted and actual values, serving as a loss function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Occam’s Razor in model selection?

A

The principle that the simplest model that fits the data should be preferred.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do we use high-dimensional feature spaces?

A

Transforming data into higher dimensions can make it linearly separable, improving classification performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is gradient descent?

A

An optimization algorithm used to find the minimum of a function by iteratively updating parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the difference between local and global minima?

A

A local minimum is a point where the function is lower than nearby points, while a global minimum is the lowest possible value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does stochastic gradient descent (SGD) differ from regular gradient descent?

A

SGD updates parameters using a single random data point or a small batch, making it faster but less stable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is convex optimization?

A

A type of optimization problem where the loss function has a single global minimum, making it easier to solve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we prevent a model from getting stuck in local minima?

A

By using techniques like momentum, learning rate adjustments, or trying different starting points in gradient descent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the difference between classification and regression?

A

Classification predicts categorical labels, while regression predicts continuous values.

17
Q

How do we evaluate a regression model?

A

Using metrics like mean squared error (MSE) to measure prediction accuracy.

18
Q

What is the purpose of a tangent line in optimization?

A

It represents the slope of a function at a point, helping determine the direction for optimization.

19
Q

Why is feature engineering important in machine learning?

A

It helps create informative features that improve model accuracy and efficiency.

20
Q

What is the bias-variance tradeoff?

A

The balance between underfitting (high bias) and overfitting (high variance) in a model.

21
Q

What does it mean for a model to be consistent?

A

A model is consistent if it accurately represents the observed data and generalizes well to new examples.

22
Q

What is a loss function in machine learning?

A

A function that measures how far a model’s predictions are from actual values, guiding optimization.

23
Q

How can increasing the number of features affect model performance?

A

It can improve accuracy but may also lead to overfitting if not managed properly.

24
Q

What is the main advantage of using higher-dimensional spaces in classification?

A

It can make previously inseparable data linearly separable.

25
Q

What is the fundamental goal of machine learning model selection?

A

To find the simplest model that generalizes well to unseen data while maintaining accuracy.