Chp 6 Geron Flashcards

1
Q

Leaf node

A

Has no child nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Node samples

A

Counts how many training instances it applies to

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Node value

A

Tells you how many training instances of each class this node applies to

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Node Gini

A

Measures its impurity
Pure=Gini=0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Binary trees

A

Nonleaf nodes, always 2 children

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Id3

A

Produce decision tree nodes with more than 2 children

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

P

A

Set of problems that can be solved in polynomial time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

NP

A

Set of problems whose solutions can be verified in polynomial time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

NP-hard

A

A problem to which any NP problem can be reduced in polynomial time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

NP-complete

A

Both NP and NP-hard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Information gain

A

Reduction of entropy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Nonparametric model

A

Number of parameters is not determined prior to training, model structure is free to stick closely to the data —overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Parametric model

A

Pre-determined number of parameters, degree of freedom is limited, reducing risk of overfitting, but increasing risk of under fitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Regularization

A

Restricting decisions tree’s freedom during training
Helps to avoid overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Min_samples_spilt

A

Minimum umber of samples a node effect it can be spilt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Min_samples_leaf

A

Minimum number of samples a leaf node must have

17
Q

Min_weight_fraction_leaf

A

Minimum number of samples a leaf node must have but expressed as a fraction of the total number of weighted instances

18
Q

Max_leaf_nodes

A

Max number of leaf nodes

19
Q

max_features

A

Max number of features that are evaluated for splitting at each node