Bayesian Statistics Flashcards

1
Q

Give two reasons for why statistics is needed

A
  • To make inferences on the population

- Assessing the uncertainty of statements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is conditional probability

A

Chance of one event occurring given that another event has occurred

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does P(0) signify? Give a term for this

A

The plausibility of a certain value of 0 before seeing the statistics ( prior beliefs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does P(data|0)/P(data) signify?Give a term for this

A

How well did this value of 0 predict the data, compared to all other values of theta? (predictive updating factor)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does P(0|data) signify? Give a term for this

A

The plausibility of 0 being equal to a particular value after seeing the data (Posterior beliefs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Give 3 characteristics regarding beta distributions (range, shape, when values= 1)

A
  • Ranges from 1-0
  • Shape is determined by two values a and b
  • if a and b equal 1 it is uniform
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does a flat prior distribution reflect? (a=1 b=1)

A

A prior distribution that reflects the
belief that all values of the proportion
are equally plausible, a priori, we call
this an uninformative prior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does a normally distributed prior distribution reflect? (a=5 b=5)

A
A prior distribution that reflects
the belief that values close to 0.5
are more plausible (i.e., there are
equal number of left/right-handed
people), a priori
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does a right skewed prior distribution reflect? (a=2 b=6)

A
A prior distribution that reflects
the belief that values below 0.5
are more plausible (i.e., there are
more right-handed people), a
priori
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What statistic is used in this situation?

A

The observed proportion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do we obtain the x% central credible interval?

A

We take x% of the most central posterior mass and see which 2 points are the thresholds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What three flaws are presented in Bayesian Statistics?

A
  1. p-values measured against a sample (fixed size) statistic with some stopping intention changes with change in intention and sample size. i.e If two persons work on the same data and have different stopping intention, they may get two different p- values for the same data, which is undesirable.
    2- Confidence Interval (C.I) like p-value depends heavily on the sample size. This makes the stopping potential absolutely absurd since no matter how many persons perform the tests on the same data, the results should be consistent.
    3- Confidence Intervals (C.I) are not probability distributions therefore they do not provide the most probable value for a parameter and the most probable values.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are parameters?

A

factors in the models affecting the observed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is meant by the term models

A

Models are the mathematical formulation of the observed events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is signified by P(data)?

A

P(D) is the evidence. This is the probability of data as determined by summing (or integrating) across all possible values of θ, weighted by how strongly we believe in those particular values of θ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What two models contribute to the posterior belief? P(0/D)

A

The prior belief P(0) and the likelihood function P(D|θ)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What mathematical function is used to represent the prior beliefs?

A

Beta distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is denoted by p(h1)/p(Ho)

A

Prior beliefs about hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

show the mathematical notation for predictive updating factor in bayesian hypothesis testing

A

P(data|H1)/P(data|Ho)

20
Q

show the mathematical notation for posterior beliefs in bayesian hypothesis testing

A

P(H1|data)/P(Ho|data)

21
Q

What do prior odds specify?

A

How plausible a hypothesis is, relative to another hypothesis, before seeing the data

22
Q

What function does the predictive updating factor serve in bayesian hypothesis testing?

A

Displaying how well the alternative or null hypothesis predicted the data

23
Q

how do you calculate the savage dickey ratio

A

Take the ratio of the prior density and the posterior density at point of testing

24
Q

What is bayes factor BF10

A

The predictive updating factor of bayesian hypothesis testing

25
Q

What does it mean if BF10=20?

A

The data is 20 times more likely under H1 than H0

26
Q

State BF10=1/20 in a different way and what does this mean?

A

BF01=20; data is 20 times more likely under H0 than H1

27
Q

Give a general interpretation of bayes factor scores

A
1-3; anecdotal
3-10; Moderate
10-30; Strong
30-100; Very strong
>100; Extreme
28
Q

In two sided hypothesis testing the marginal likelihood will be _______

A

Lower

29
Q

What name is given to The concept of rewarding a more specific model, while it predicted the data
equally well

A

Parsimony

30
Q

What is the relative nature of the bayes factor?

A

A high Bayes factor does not mean the hypothesis is true, it just means it predicted the data better (or
less poorly) than the other hypothesis

31
Q

The prior distribution has an a=2 and b=6. New data comes in with 23 correct and 7 incorrect results. What is the new data for the distribution

A

a=25 b=13

32
Q

What domain is the prior distribution in for a difference in means

A

-infinity, infinity

33
Q

What is meant y a stretched beta distribution

A

the same as Beta, but then stretched to the domain [-1, 1]

34
Q

What happens to the likelihood distribution if the sample size increases?

A

It grows narrower

35
Q

What else does the sample size effect?

A

The correlation increases with the sample size

36
Q

What are differences between groups categorized by in psychology?

A

Delta (d); a standardised difference between groups

37
Q

What is the null hypothesis for a bayesian t test?

A

H: d=0

38
Q

What is the formula for a bayesian t test

A

P(d|data)= P(d)(P(data|d)/P(data)

39
Q

What is a cauchy distribution and when do we use it?

A

The Cauchy distribution is governed by a single shape parameter that determines how wide it is, we use it when the parameter is between infinite and -infinite. It is a t distribution with df=1.

40
Q

What type of prior distribution for a bayesian t test is usually used as an uninformative priori?

A

when 50% of the values of δ are located

between -0.707 and 0.707

41
Q

What priori do we usually use for a two sided alternate hypothesis?

A

A prior distribution that reflects
the belief that 50% of the values
of δ are located between -1.5 and
1.5 , a priori

42
Q

What priori do we usually use for a one sided alternate hypothesis?

A
A prior distribution that reflects
the belief that only positive values
of δ are possible and that 50% of
the values of δ are between 0
and 0.707, a priori
43
Q

What is meant by the Bayes factor

robustness check, or sensitivity analysis?

A

Exploring what would have happened if we had chosen a different value for the prior width.

44
Q

What does pre registration help prevent?

A

presenting exploratory research as confirmatory

research: exploratory research is used to generate theories and hypotheses; confirmatory research is used to test those

45
Q

What is conjugacy?

A

Using a beta distribution as a prior and a binomial likelihood to produce a posterior beta distribution.