Questions #2 Flashcards

1
Q

True or false : Putting too many parameters into a model results in overfitting the model

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or false : The best model should provide the most information

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Can you tell me the definition of Information

A

The information of an outcome is defined as the decrease in uncertainty from observing the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 3 properties for a measure of uncertainty?

A
  1. Continuity : Should be a continuous function of the parameters of the distribution
  2. Additivity
  3. Monotonicity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the unique measure of uncertainty that satisfies the 3 properties ?

A

Information entropy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the definition of cross-entropy?

A

Cross-entropy is a measure of uncertainty of using a different distribution with event probabilities q, to estimate a distribution with the same events with probabilities p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or false : Cross-entropy is symmetric

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or false : Using a low-entropy distribution to predict a high entropy distribution is worst than the opposite

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or false : The Kullback-Leibler Divergence grows as the esitmate moves away from the true distribution

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

True or false : The Kullback-Leibler Divergence is symmetric

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define the Deviance formula

A

-2 times the loglikelihood

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or false : The lower the deviance, the better

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Tell me the steps to calculate LPPD

A
  1. At each point, take the average of the sample
  2. Log the average
  3. Sum all the logs over all points
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or false : Deviance is measure of predictive accurary, not of truth

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

True or false : When doing cross-validation, to truly use deviance or lppd as measure of accuracy, it should be calculated on the test data

A

True, because the deviance will be lower on the training data when we add parameters, even if they are not relevant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Complete : One way to avoid overfitting is to use ..

A

regularizing prior

17
Q

What is the regularizing prior?

A

A regularizing prior is one that contains information. The more informationit has, the stronger the regularization

18
Q

True or false : A regularizing prior is skeptical of the information; the stronger the regularization, the more data is needed to overwhelm it

A

True

19
Q

True or false : LOOCV does not require lots of computer runs

A

False. It requires a lot of computer runs

20
Q

Can you tell me an alternative to LOOCV

A

Pareto-Smoothed importance sampling cross-validation (PSIS)

21
Q

Can you tell me an alternative to LOOCV which is not PSIS

A

Information criteria

22
Q

Define the formula of the AIC with the deviance

A

Use the D of the training data

23
Q

When we are calculating AIC, we are using the deviance of the training data. AIC can be calculated with the deviance of the test if : (3)

A
  1. The prior is flat
  2. The posterior is approximately multivariate normal
  3. The size of the same is much greater than the number of parameters
24
Q

True or false : WAIC and PSIS are similar for ordinary linear models

A

TRUE

25
Q

Cross-validation and PSIS have higher variance as estimators of divergence, but WAIC has higher bias

A

True

26
Q

Large differences between WAIC and PSIS imply one of them is unreliable

A

True

27
Q

WAIC identifies highly influential observations, unlike PSIS

A

False, c’est le contraire

28
Q

True or false : When doing the information criteria, it is important that each model has the same number of observations

A

True

29
Q

True or false : Normal distribution has a heavier tail than the student distribution

A

False. Student has a heavier tail