ch. 1 Flashcards

1
Q

define: training set

A

data used to tune parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

define: test set

A

data used to test accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

define: generalization

A

correctly predict outcome of new input data, according to the expected outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

define: preprocessing/feature extraction

A

processing applied to input data to facilitate learning by reducing variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

define: classification

A

assigning inputs to a finite number of discrete categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

define: regression

A

assign input to one or more continuous variable(s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

define: supervised learning

A

training data is comprised of inputs with corresponding outputs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

define: clustering

A

(in unsupervised learning) goal of distinguishing groups with similar properties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

define: density estimation

A

(unsupervised learning) goal of finding distribution of data in input space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

define: reinforcement learning

A

maximize reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

subject: “Here the learning algorithm is not given examples of optimal outputs, but must instead discover them by a process of trial and error. Typically there is a sequence of states and actions in which the learning algorithm is interacting with its environment. In many cases, the current action not only affects the immediate reward but also has an impact on the reward at all subsequent time steps.”

A

reinforcement learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

vocable: reinforcement learning problem in which reward is attributed to all steps in successful outcome

A

credit assignment problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

(reinforcement learning) define: exploration

A

system tries new actions to see their efficacy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

(reinforcement learning) define: exploitation

A

system uses action that it knows yield high reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

vocable: system accurately predicts training data, but not test data

A

over-fitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what happens to severity of over-fitting as the size of the data set increases

A

decreases

17
Q

“By adopting a ___ approach, the over-fitting problem can be avoided. We shall see that there is no difficulty from a ___ perspective in employing models for which the number of parameters greatly exceeds the number of data points.”

A

Bayesian

18
Q

“in a Bayesian model the effective number of parameters adapts automatically to the ___”

A

size of the data set

19
Q

what is the goal of regularization

A

minimize over-fitting (in a non-Bayesian model)

20
Q

what is the goal of regularization

A

minimize over-fitting (in a non-Bayesian model)

21
Q

how does regularization work

A

adds penalty term to error function to discourage coefficients from reaching extremely large values

22
Q

define: shrinkage

A

method that reduces the value of coefficients

23
Q

shrinkage in context of neural-networks

A

weight decay