Week 6 SCM (Bayes) Flashcards
what is bayesian statistics vs. frequentist statistics
- bayesian is a way of understanding data based on the odds of various outcomes
- the frequentist approach looks at the probability of different data under the null hypothesis (pvalues).
what is a latent process?
the name given to an unseen (random) process that generates observed data
what is a latent variable?
a variable that is not observed but rather inferred through a mathematical process
for example a population mean can be a latent variable that is inferred through the sampling of a population and analysis of the sample mean
what is bayesian inference?
the process of making inferences about a latent variable based on observed data, e.g estimating a population mean from the sample mean
how to estimate posterior distribution using bayesian inferences
- if we dont have specific information for the prior we can use a distribution as the prior such as a uniform distribution
- these priors can be uninformative or weakly informative
- its implies that all values are equally likely
what is the MAP (maximum a posterieori) estimate?
- the value in the distribution in which the probability is the highest
what does a 95% credible interval tell us?
- there is a 95% probability that the probability value of an outcome falls between these two values
- it can be used to show that the probability of an outcome is significantly above 0 suggesting that the outcome is likely to happen
how is it common to generate credible intervals?
by sampling from the posterior distribution and then computing quantiles from the samples
whats the bayes factor?
the bayes factor characterises the relative likelyhood of the data under two different hypothesis’s
what is the hypothesis test for the frequentist and bayesian approaches?
Frequentist test= p values
Bayesian= Bayes Factor
What is the estimation of uncertainty for the frequentist and bayesian approaches
Frequentist= maximum likelyhood estimate with confidence intervals
Bayesian= posterior distribution with highest density interval
where is the marginal likelyhood in the bayes formula
the marginal likelyhood is on the denominator of the equation
(the numerator of the equation includes the prior and the likelyhood)
what is the bayes factor formula
BF= P (data | H1) / P (data | H2)
how do you interpret bayes factors
> 150 is very strong significance
20-150 is strong
3-20 is positive
1-3 is not really worth a mention
why is the bayes factor better for assesing evidence for the null hypothesis
- because the bayes factor is comparing evidence for the two hypotheses it allows us to assess whether theres evidence for the two hypotheses
- we cant do this with standard null hypothesis testing because it starts with the assumption that the null is true