chapter 8 Flashcards

1
Q

statistical inference

A

a collection of methods that deal with drawing conclusions from data that are prone to random variation
central idea: probabilistic/statistical model: view data as realisations of random variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

statistical model

A

gives the probability distribution under which we assume the data have been generated. Typically this model contains unknown parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Frequentist approach

A

parameters are considered fixed quantities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Bayesian approach

A

parameters are assigned a probability distribution that quantifies information we have about them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

undefrit models

A

experience high bias—they give inaccurate results for both the training data and test set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

overfit models

A

experience high variance—they give accurate results for the training set but not for the test set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

bias

A

average distance between predictions and the truth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

bias-variance trade off

A

high bias ⇒ underfiting on training data
high variance⇒overfitting on training data
low bias ⇒ adapt well on training data
low variance ⇒ generalise well on unseen data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

high bias + high variance

A

high average training error, inconsistency in prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

high bias + low variance

A

high average training error, consistent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

low bias + high variance

A

low average training error, inconsistency in prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

low bias + low variance

A

low average training error, consistent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

decision rule

A

reject H0 for large values of T; so if T ≥ c. T € K := [c, ♾️)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

acceptance region

A

the set of values t, for which H0 is accepted, so K^c

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

rejection region

A

the set of vales t, for which H0 is rejected, so K

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

critical value

A

the threshold value c of t between rejecting and not rejecting H0

17
Q

Type I Error

A

the probability of wrongly rejecting H0

18
Q

Type 2 Error

A

the probability of wrongly accepting H0

19
Q

significance level, α

A

the maximum allowable type 1 error

20
Q

Power of a test

A

the probability of rejecting the null hypothesis, given it is false
Power = 1 - P(Type II Error) = 1 - β

21
Q

small α →

A

more difficult to reject H0; Type II errors will become common, decreasing power

22
Q

large α →

A

Type II errors will become less common, increasing power

23
Q

P-value

A

the lowest significance level that results in rejecting the null hypothesis