Exam2 Flashcards

1
Q

What is machine learning

A

Training a model on known data to predict generalized outcomes on unseen data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 3 parts of the Machine learning roadmap

A

dimension reduction, if no –> Have responses, if yes –> predicting numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

supervised vs unsupervised

A

supervised: the data is labeled
unsupervised: the data is NOT labeled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is classification

A

given an input, categorizing it:
(ie duck or fruit, cat or dog)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Regression

A

given an x value what is the y value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

unsupervised learning example

A

clustering: grouping objects based on similarity or differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Supervised learning example:

A

regression or classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

classification vs regression

A

regression predicts continuous values and classification predicts discrete labels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is a decision tree

A

tree structure, splits based on features and identifications happen at leaf nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Learning Road Map

A
  1. pre processing
  2. model selection
  3. model training
  4. model evaluation
  5. model deployment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

SVM

A

find a line to chop the data in half

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

characteristics of SVM

A
  • robust to noise
  • overfitting is handled by maximizing margin
  • handles irrelevant attributes better than many techniques
  • difficult to handle missing values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is generalization error

A

test error - training error. how well the model generalizes well to new data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How to improve a model that has poor generalization error

A

More Training data or simplifying the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

NN Params

A

Neurons per layer
number of hidden layers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly