lecture 7 machine learning bigger picture Flashcards

1
Q

how many types of machine learning methods are there ?

A

two types supervised learning and unsupervised learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is supervised learning ?

A

learning rules that describes input/output relationship

two forms- regression and classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is unsupervised learning ?

A

learning rules that describes input only

two dimension reduction and clustering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

is it true regssion belongs to supervised learning ?

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

is it true classification belongs to supervised learning ?

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

does reduction belong to supervised learning ?

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

does clustering belong to supervised learning ?

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what regression model ?

A

task of fitting model to training data. The learned model can be used to make predictions about continuous valued output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is classification ?

A

instead of continuous valued output. Classification outputs a prediction from a discrete set of values, collapsed and standing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is clustering ?

A

identifies structure in data by grouping samples that share same characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is dimension reduction ?

A

if you have data set with a lot of features and you want to reduce the number of features while having most of data original variability. Dimenssion reduction using principal component analysis helps reduce the dimensionality of the dataset by transforming data into fewer dimenssion that still captures most of the variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are machine learning typical steps ?

A

This is done through
-Data collection ( available sources, your own collection )
-feature design (manual versus automated )
-model training
-model validation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is training data set?

A

the set of data on which actual training takes place. Bigger set and model learns from. Anlogy would be example/homework problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is validation set ?

A

the set of data is used during the training phase of the model to provide unbiased evaluation of the models performance and to fine tune the model. i helps select model. Analogy would be sample/mock exam.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is test data set ?

A

after model has been trained fully, use test to assess/report your model performance . Analogy actual exam

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do divide the data into test, validation and training?

A

we split data randomly

17
Q

what is training error ?

A

training error measures how well a model fits the training data

18
Q

what is validation error

A

help in the model selection and tuning process during development

19
Q

what is test error ?

A

measure how well a model generalizes to new data. In machine learning we only care about test error.

20
Q

Training error is high for low complexity/ less flexiable model i.e underfitting (T/F)

21
Q

test error intially goes down, but eventually increases ( overfitting ). T/F

22
Q

In supervised learning, one can have only one input feature ? (prof)

23
Q

Pandas is imported in your program
To read .csv file
– To see some rows of imported data on your screen
– To get some basic info about the data on your
screen
– All of the above

A

– All of the above

24
Q

Clustering is an example of supervised
learning
– True or False

25
Q

Regression is an example of supervised
learning
– True or False

26
Q

Classification is an unsupervised learning
– True or False

27
Q

Test error does not depend on the model
complexity at all
– False or True

28
Q

Training error always increases as the model
complexity increases
– False or True

29
Q

Our objective to have the model
– Over fit
– Under fit
– High complexity
– None of the above