Questions #1 Flashcards

1
Q

True or false : Grid approximation is a technique to approximate the posterior distribution

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Tell the steps for the grid approximation

A
  1. We select points of theta for which we want to calculate grid
  2. For all the points, we do prior * likelihood
  3. We sum all the points to have the total
  4. You divide each point to the sum
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

True or false : Grid approximation is not very good tool when there is more than 1 parameter

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Can you tell me the definition of a 95% confidence interval

A
  1. A 95 % confidence interval would contain the true value 95% of the time (No probability)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Can you tell me the definition of a 95% percentile interval / probability interval

A

A 95% percentile interval would contain the true value with a 95% probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

True or false : HDPI is the best interval we can have

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Tell the 3 disadvantages with the HDPI

A
  1. Harder to compute
  2. Harder to understand
  3. It has greater simulation variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or false : If the priors are flat, the results of the bayesian methods will be similar to maximum likelihood

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or false : We use flat priors when we are uncertain about the parameter values

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

True or false : In a model with 1 predictor, if the estimates for the intercept and coefficient of the predictor are highly correlated, centering the predictor may resolve the problem

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the 2 advantages of standardizing a predictor

A
  1. It may make interpretation easier. Adding 1 to a standardized variable is equivalent to adding 1 standard deviation. But if we want to interpret on the natural scale, more difficult
  2. Glitches that occur with large numbers are avoided. Standardizing is very useful when variables are squared or raised to a power
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the meaning of centering?

A
  1. Centering consists of substracting the mean of the predictor from the predictor. After centering, the estimate is easier to interpret.
  2. Before centering, intercept = average value of the outcome when the predictors is at its average value
  3. After centering, intercept = average outcome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain to me when we see a spurious relationship

A

When an irrelevant predictor is correlated to a relevant predictor, we can see a spurious relationship between the response and the irrelevant predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain to me the purpose of a predictor residual plot

A
  1. Run a regression with all predictors except for 1
  2. Then regress the response on the residuals
  3. Show the relationship between the response and the omitted predictor after removing the effect of the other predictors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explain to me what is a counterfactual plot

A
  1. In a counterfactual plot, all predictors except for 1 are constant. The plot shows how the response reacts to change in one predictor
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or false : For cases where more than 1 variable has significant impact on the response, counterfactual plot may not be useful

A

true

17
Q

Explain to me what is a posterior prediction plot

A

Show the predicted vs observed values of the response

18
Q

What is a masked relationship?

A

When a relationship between a predictor and the response is not significant when it is the only predictor in the regression, but is significant when there is another predictor in the regression. Happening when both predictors are correlated to each other, but one is positively correlated with the response, and the other one is negatively correlated with the response

19
Q

True or false : Predictors that are too highly correlated result in a good model

A

False. It result in a poor model

20
Q

Complete : Generally, correlation with absolute value higher than … will lead to poor result

A

0.9

21
Q

True or false : Multicollinearity is part of a family of problems called non-identifiability

A

True

22
Q

How do we solve the asymmetry problem for the categorical variables in the bayesian regression?

A

By putting an indice on the intercept

23
Q

True or false : When predictors are multicollinear, the model makes poor predictions

A

False. The model still make good predictions

24
Q

True or false : Estimates of coefficients for multicollinear models have high variance

A

True

25
Q

True or false : The coefficients of predictors with positive correlation have negative correlation

A

True