Week 7 Goodness of fit tests Flashcards
what is the most widely used goodness of fit test
chi-squared test
what is the probability distribution of chi-squared equation
P(X^2; N) = 2^-(N/2) / Γ(N/2) X^(N-2)e^-(X^2/2)
N is the number of degrees of freedom
what is one possible interpretation of the chi-squared tests
if you conduct a large number of fits with an appropriate model to independent, each with N degrees of freedom then the chi-squared probability corresponds to the fraction of fits that result in a chi-squared value greater than the threshold
what is the most important criteria when selecting a specific model for best fit
its meaningfulness in terms of its physics interpretation
what is the Akaike Information Criterion equation
AIC = -2ln[L(x I ahat, M)] + 2k
ahat is the parameter that maximises the likelihood for model M which has k parameters
what is the Bayesian Information Criterion equation
BIC = -2ln[L(x I ahat, M)] +kln[n]
a is the parameter that maximises the likelihood for model M which has k parameters and n datasets
why is BIC better than AIC
only BIC is guaranteed to identify the true model among a set of candidates
what are the AIC and BIC chi-squared equations respectively
AIC = X^2 + 2k BIC = X^2 +kln[n]
when we have to compare two datasets what can we do if we know there deviation values
we compare the difference between two values with the combined uncertainty of the datasets
what is the Kolmogorov-Smirnov KS test used for
comparing a distribution that is fully defined of data events to a given distribution
what are the two equations associated with the KS test
D = max I cum(x)-cum(P) I d = Dsqrt(N) D is the largest difference between two distributions d is the scaled difference N is the number of trials