L.9 - Bayesian ANOVA Flashcards

Question

Central credible interval

Answer 1

- take middlemost 95% in posterior distribution - credible interval tells us how likely the interval is to contain true value

Answer 2

- from 0.48 to 0.94 - if Alex's model is true model, there is 95% probability of theta being between 0.48 and 0.94 - this is probabilistic statement about true value of the parameter ! conditional to model we are working with ! (different credible interval for different prior distribution models)

Answer 3

- does confidence interval widely if I tweak my prior distribution? - how robust is my conclusion over two different prior distributions?

Answer 4

- Bayesian quantify uncertainty through distributions - the more peaked the distribution, the lower the uncertainty > but those very certain models don't learn very well - incoming infomation continually updates our knowlege > today's posterior is tomorrow's prior

Answer 5

- H0: null hypothesis; no significant difference (theta= 0.5) - Ha: considers multiple values of theta > just two different models with marginal likelihoods

Answer 6

- *see image 9* - through this formula, we update our hypothesis (which one is more likely?)

Answer 7

- updating factor to go from prior beliefs about hypothesis to posterior beliefs - ratio of marginal likelihood (compare m.l. of Ha to m.l. of H0) - single number that quantifies evidence in favour of one hypothesis over another

Answer 8

- ratio between the probability of that data under all the values in our Alternative hypothesis and probability of data under all values of Null hypothesis - B.F. = P(data|Ha) / P(data|H0)

Answer 9

BF10 = 3 - 1: Ha - 0: H0 = the data are 3 times more likely under alternative model than under null model ! the greater the BF, the more evidence we have in favour of one hypothesis over the other

Answer 10

1a) how likely is getting 8 heads under all the values in Sarah's model in average? > 0.04 1b) how likely is getting 8 heads under all the values in Alex's model in average? > 0.09 2) we calculate ratio of these marginal likelihoods > BFsa: 0.04/0.09 = 0.44 → the data are 0.44 times more likely under Sarah's model than under Alex's model > BFas: 1/0.44 = 2.25 (or 0.09/0.04) → the data are 2.25 times more likely under Alex's model than under Sarah's model

Answer 11

BF → evidence 1-3 → anecdotal 3 - 10 → moderate 10 - 30 → strong 30 - 100 → very strong >100 → extreme ! these are just guidelines, small differences between BFs are not important ! usually, we look at BFs over 10 (Johnny's opinion)

Answer 12

- see image 10 - try to understand the graphs!

Answer 13

- it is a continuous assessment of evidence in favour of one or the other hypothesis > no black and white reasoning about statistical significance (such as in frequentist stats) - it allows to monitor BF as we gather data > not possible with frequentist stats - differentiates between the evidence of absence and the absence of evidence

Answer 14

- evidence of absence: data supports H0 - absence of evidence: data are not informative (BF close to 1)

Answer 15

- yi = b0 + b1*xi - yi: observed variable - b0: intercept - b1: group difference (regression weight) - xi: tells us whether we are predicting for control or experimental condition

Answer 16

- group difference - it is the parameter of interest - if b1=0 → no group difference

Answer 17

- H0: b1=0 > H0: there is no difference between conditions - Ha: b1≠0 > there is a difference between conditions

Answer 18

- *see image 11* - different from frequentist statistics; here we specify distribution for alternative model → we specify probability distribution for b1

Answer 19

- Beta: [0,1] - b1 [-∞,∞] → we have to use distribution that matches domain of b1

Answer 20

- Cauchy distribution > prior distribution > t-distribution with one single degree of freedom >conventionally used when talking about a difference in means - *see image 12*

Answer 21

- it allows b1 to take any value - important compared to null model - *see image 12*

Answer 22

- both the null and alternative model show predictions of how the data would look like if that model was right - each model has its own marginal likelihood > what is the average likelihood under this model for the observed data?

Answer 23

- we can calculate m.l. in same way as with binomial test, but now we condition on different values of our regression weight - *see image 13*

Answer 24

- data are more likely for b1 values close to 0.8-0.9 - data are very unlikely for b1 values close to 0 (eg)

Answer 25

! average likelihood of data for each of the values in a specific model > dependent on model we have - in M0, the marginal likelihood is close to 0 because that is the only b1 value that our null model predicts

Answer 26

- likelihood around zero is incredibly low - marginal likelihood of M0 is also very low (only b1 predicted is zero) ! we must compare marginal likelihood between two models to interpret it

Answer 27

- P(data|M1) / P(data|M0) - *see image 15*

Answer 28

- see picture 16 - tastiness = b0 + b1*alcoholic + b2*correct - M0: no effect of alcohol of beers in tastiness ratings and no effect of being correct on tastiness ratings - Ma: model with main effect of alcohol (b1) - Mc: model with main effect of correctness (b2) - Ma+c: model with intercept b0 and two main effects ! we compute factorial ANOVA

Answer 29

- bayesian factorial ANOVA constructs 4 models, and calculates how well each model predicts the data, across all values in that specific model

Answer 30

- now each model has two prior distributions (one per predictor) > in this case, each model has a prior distribution for alcohol, and one for correctness - *see image 17, 18 & 19*

Answer 31

1. Bayesian paired-sample t-test > to see whether there is a difference between the alcoholic vs non-alcoholic ratings *- see image 20*

Answer 32

- for Ha: alcoholic beer is tastier > *see image 21* - for Ha: non-alcoholic beer is tastier >* see image 22* > now evidence in favour of null hypothesis (alternative model does worse than null) !! side of alternative hypothesis matters !!

Answer 33

- we can assess robustness of test with "Robustness check" under "Plots" - e.g. would I have a completely different result if I had used a width of 1 instead of .7? -* see image 23*

Answer 34

- basically a paired-samples t-test - comparing two within-subject groups > alcoholiness of the beer is repeated measure > we add correctness as between-subject variable = we get different types of Bayes factors - *see image 24*

Answer 35

- BF10: compares one model to other model - it compares each model specified in the row, to the model that predicted the best (highest marginal likelihood) - JASP puts the model with highest marginal likelihood in the first row - BF10 in first row is always 1 (compare model with itself) - *see image 25*

Answer 36

- "Alcoholic + correctness" model predicts data the best > therefore the BF is 1 (compares that model to itself) - data are 0.8 times as likely under the "Alcoholic" model compared to the model with two main effects - in the last row, we can see that last two models predicted data so much worse than first model

Answer 37

- under "order" option in JASP - through this option, the table is re-ordered > null model is in first row > each BF compares model in the row and null model

Answer 38

- instead of quantifying evidence in favour of individual models, we can look at quality of prediction of all models containing one effect, and compare those to models that don't have that effect - click under "effects" option in JASP and get "analysis of effects" table > we get every effect in our design (e.g. alcoholic and correctedness) and we have BFincl - *see picture 26*

Answer 39

- Inclusion bayes factors - they quantify evidence in favour of including specific effect - last column in table - they compare groups of models with groups of other models

Answer 40

- e.g. compare all models that include alcohol and all models that don't include alcohol - data are 100,000 times more likely under model with alcohol included - models with correctedness → absence of evidence (low BF)

Answer 41

- blue "i" button - describes all settings and output - you can use it in the exam, but should still practice so that you don't waste time

Answer 42

!!- we must pay attention to interaction effects - under "models", we click CTRL on keyboard and select the two components, then drag interaction in "model terms" -* see image 27*

Answer 43

- no big difference for the alcoholic beers whether people are correct or not - big difference for non-alcoholic beers whether people are correct or not > people rate beers differently based on whether they are correctly identifying the beers = when incorrect, no difference in rating between alcoholic and nonalcoholic beers = when correct, there is huge effect between non-alcoholic and alcoholic beer ! we can also flip graph to make it clear

Answer 44

- in "descriptives": > we put "alcoholic" under horizontal axis > + credible interval > put "correctedness" under separate lines - *see image 28* ! always visualize your data !

Answer 45

- now it is in the first row, which means that it is the model that predicts data the best

Answer 46

- *see image 29 & 30*

L.9 - Bayesian ANOVA Flashcards

(72 cards)