Week 1 Flashcards

1
Q

Name the 3 learners inputs

A

Domain Set, Label Set, Training Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define Domain set

A

An arbitrary set of objects we wish to label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define Label Set

A

Assign number to label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Training Data

A

Input that the learner has access to

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Measure of success

A

error of a classifier, the probability that it does not predict the correct label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define overfitting

A

When hypothesis fits the training data too well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Inductive bias

A

bias toward a particular set of predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Confidence parameter

A

probability of getting nonrepresentative sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Accuracy parameter

A

quality of prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

ERM

A

Empirical Risk Minimuzation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

PAC

A

Probability Approximately Correct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

K-NN

A

K Nearest Neighbor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

k-NN rule summary

A

assumption that things that look alike must be alike

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why do we need parameters for linkage based clustering algorithms

A

if kept going, it would eventually result in trivial clustering in one large cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Name the two parameters of clustering needed

A

Measure or define the disance between clusters, and determine when to stop merging

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Three types of linkage clustering

A

Single Linkage, Average Linkage, Max linkage

17
Q

k-means objective function

A

uses centroid disctance