4 Flashcards

1
Q

Which LR training method can you use for sets with millions of features

A

Stochastic or Mini Batch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which Training algorithms suffer from poor scaling

A

Gradient Descent will be slow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Can GD get stuck in local minima

A

Yes - Stochastic may help has it could bounce out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Do all GD algorithms end up in the same place

A

Yes - however stochastic and mini batch will never reach optima

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When using gradient descent - if the validation error starts to rise whats going on and how can it be fixeD

A

The model is over fitting and training should be stopped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Should mini-batch GD be stopped when the validation error goes up

A

No - it may just be a random occurrence it could continue to fall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which GD algorithm will reach a good solution the fastest

A

Stochastic however it will never converge perfectly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If there is a large gab between training and validation errors for polynomial regression whats happening and how can it be solved

A

Over fitting

Can be fixed by reducing polynomial degree, regularize the model, more training data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If training and validation error are equal for ridge regression and both high does the model suffer from high bias or variance- should you increase the regularization hyper parameter or reduce it

A

Under fitting meaning a high bias. Try reduce the regularization hyper parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why would you want to use:
Ridge Regression instead of linear
Lasso Instead of Ridge
Elastic net instead of Lasso

A

Regularized models generally perform better

Automatically performs feature selection

Lasso may be erratic - Elastics with a l1 ratio close to one stops this

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

To classify images as indoor or outdoor and day or night should you implement two logistic regression classifiers or one Softmax Regression Classifier

A

These are not exclusive classes (could be multiple) so two logistic Regression is suitable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly