Statistics Flashcards
P-value
probability of rejecting H0 with the value of the test statistic obtained from the data given H0 is true (small p value is preferred)
(if p value is small then it’s saying that the observation is unlikely to happen, so the data is statistically significant enough to reject the null hypothesis)
Confidence interval
measures the degree of uncertainty or certainty in sampling method, and is the range of plausible values for an unknown parameter (finding test statistic)
Odds ratio
describes the strength of the association between two events, ad/bc (the ratio of the odds of A in the presence of B and the ratio of the odds of A in the absence of B. p1/1-p1 / p2/1-p2)
Z distribution
normal distribution when variance is known with a large sample size
t distribution
normal distribution when variance is unknown with a small sample size
power
probability of avoiding a type 2 error (when the type 2 error is to accept a false hypothesis), (1 - p(type 11 error)
Type 1 error
h0 is rejected but h0 true
Type 2 error
h0 is accepted but h0 false
Unbiased
expected value of the estimator is the parameter (when the estimator is an estimate of the parameter)
MSE
variance(T) + bias(T)^2
consistent
statistic tends to the parameter as n increases
Method of moments
equating E(t) = mu and solving in terms of parameter, can also do for variance if sample has multiple parameters
MLE
method for choosing the ‘best’ parameter which maximises the probability that the parameter produces the sample
Sufficient
statistic is sufficient if it contains all the information about the parameter in it
Neyman-Fisher factorisation theorem
the likelihood can be factorised in terms of h(x) and g(t, theta) where t is a sufficient statistic
Cram´er-Rao lower bound.
smallest the variance of any unbiased estimator can become is 1/I(θ)