Questions #2 Flashcards by Jean-Philippe Chagnon

True or false : Putting too many parameters into a model results in overfitting the model

True

How well did you know this?

Not at all

Perfectly

True or false : The best model should provide the most information

True

How well did you know this?

Not at all

Perfectly

Can you tell me the definition of Information

The information of an outcome is defined as the decrease in uncertainty from observing the outcome

How well did you know this?

Not at all

Perfectly

What are the 3 properties for a measure of uncertainty?

Continuity : Should be a continuous function of the parameters of the distribution
Additivity
Monotonicity

How well did you know this?

Not at all

Perfectly

What are the unique measure of uncertainty that satisfies the 3 properties ?

Information entropy

How well did you know this?

Not at all

Perfectly

What is the definition of cross-entropy?

Cross-entropy is a measure of uncertainty of using a different distribution with event probabilities q, to estimate a distribution with the same events with probabilities p

How well did you know this?

Not at all

Perfectly

True or false : Cross-entropy is symmetric

False

How well did you know this?

Not at all

Perfectly

True or false : Using a low-entropy distribution to predict a high entropy distribution is worst than the opposite

True

How well did you know this?

Not at all

Perfectly

True or false : The Kullback-Leibler Divergence grows as the esitmate moves away from the true distribution

True

How well did you know this?

Not at all

Perfectly

True or false : The Kullback-Leibler Divergence is symmetric

False

How well did you know this?

Not at all

Perfectly

Define the Deviance formula

-2 times the loglikelihood

How well did you know this?

Not at all

Perfectly

True or false : The lower the deviance, the better

True

How well did you know this?

Not at all

Perfectly

Tell me the steps to calculate LPPD

At each point, take the average of the sample
Log the average
Sum all the logs over all points

How well did you know this?

Not at all

Perfectly

True or false : Deviance is measure of predictive accurary, not of truth

True

How well did you know this?

Not at all

Perfectly

True or false : When doing cross-validation, to truly use deviance or lppd as measure of accuracy, it should be calculated on the test data

True, because the deviance will be lower on the training data when we add parameters, even if they are not relevant

How well did you know this?

Not at all

Perfectly

Complete : One way to avoid overfitting is to use ..

Study These Flashcards

regularizing prior

What is the regularizing prior?

Study These Flashcards

A regularizing prior is one that contains information. The more informationit has, the stronger the regularization

True or false : A regularizing prior is skeptical of the information; the stronger the regularization, the more data is needed to overwhelm it

Study These Flashcards

True

True or false : LOOCV does not require lots of computer runs

Study These Flashcards

False. It requires a lot of computer runs

Can you tell me an alternative to LOOCV

Study These Flashcards

Pareto-Smoothed importance sampling cross-validation (PSIS)

Can you tell me an alternative to LOOCV which is not PSIS

Study These Flashcards

Information criteria

Define the formula of the AIC with the deviance

Study These Flashcards

Use the D of the training data

When we are calculating AIC, we are using the deviance of the training data. AIC can be calculated with the deviance of the test if : (3)

Study These Flashcards

The prior is flat
The posterior is approximately multivariate normal
The size of the same is much greater than the number of parameters

True or false : WAIC and PSIS are similar for ordinary linear models

Study These Flashcards

TRUE

Cross-validation and PSIS have higher variance as estimators of divergence, but WAIC has higher bias

True

Large differences between WAIC and PSIS imply one of them is unreliable

True

WAIC identifies highly influential observations, unlike PSIS

False, c'est le contraire

True or false : When doing the information criteria, it is important that each model has the same number of observations

True

True or false : Normal distribution has a heavier tail than the student distribution

False. Student has a heavier tail

Questions #2 Flashcards

(29 cards)