10) Bayesian learning in practise Flashcards
What is the pdf of the beta distribution
What are the parameters of the prior known as
Hyperparameters, to distinguish them from the parameters of the likelihood function
In terms of mean parameterization Beta(πβ, πβ), how are the prior concentration parameter and the prior mean parameter set
- π0 = πΌ1 + πΌ2
- π0 = πΌ1/π0
What are the prior mean and prior variance for the beta-binomial model
What is the posterior distribution for the beta-binomial model
What are the posterior updates in the beta-binomial model
What does pseudodata mean in the context of Bayesian inference
Pseudodata refers to treating the prior distribution as if it represents additional data points. Specifically:
* Ξ±1 andΞ±2 act as pseudocounts that influence both the posterior mean and variance
* These pseudocounts influence the posterior distribution similarly to how real observed data does.
* The prior adds βvirtualβ observations, stabilizing estimates when actual data is sparse
What is shrinkage intensity in the Beta-Binomial model
Shrinkage intensity, Ξ», is a factor in the Beta-Binomial model that determines the weight of the prior mean in the posterior mean calculation
What does the shrinkage intensity indicate
This factor indicates how much the prior information influences the posterior mean compared to the observed data
When Ξ» =0: The posterior mean is equal to the ML estimate
When Ξ»β1: The posterior mean corresponds closely to the prior mean (π0)
What is shrinkage
The adjustment of the ML estimate towards the prior mean is called βshrinkage,β as π^ ML is βshrunkβ towards π0 which is often the target mean
What is a conjugate prior
If the prior and posterior belong to the same distributional family the prior is a conjugate prior
Why are conjugate priors useful
- Conjugate priors are computationally convenient
- They allow Bayesian learning by only updating prior parameters
- This avoids complex calculations
What happens to the posterior mean and variance in the Beta-Binomial model as the sample size n becomes very large
What is the Bayesian Central Limit Theorem