unit 2 - chapter 8 - confidence intervals Flashcards
we want…
Want full information not partial but we aren’t working with full information
sampling, samples and statistics
Risk and uncertainty
CI are an assignment of risk quantifying risk and the risk attached to the uncertainty of any statistic
Risk =/= injurious - if can quantify the risk – which CI can do– you cna make decisions with some sense of awareness
confidence intervals
CI are an assignment of risk quantifying risk and the risk attached to the uncertainty of any statistic
We want the sample to represent the population so
we take a good sample
the confidence interval gives us…
CI gives us CONTEXT
For interpreting our single statistic about how good is our sample about mew’s that would be consistent with you data
X bar taking a selfie and mew shows on the phone… sample is a snapshot of population average
the confidence interval equation
mew = zbar +- z(a/2) (s/square root of n)
X bar = sample mean
Z (a/2) = confidence level coefficient (z score)
(S / square root of n) = standard error of the mean
reporting a confidence interval
- CI [LB,UB]
- LB =< u =< UB
the confidence interval equation and the law of large numbers
- As n increases, the observed stat moves closer to the true parameter
- As n increases, variation within a sample decreases
- Smaller samples are identified with greater (variation) uncertainty
comparison of z distibutions
- profile: symmetrical/bell curve
- shape: statistic
- mean: mew = 0
- std deviation: s = 1
comparison of t distibutions
- profile: symmetrical/bell curve
- shape: bell curve
- mean: change (n)
- std deviation: => 1
t distributions
sampler (smaller?) distribution
Smaller n
If n = 30 then t and z are pretty close
T distribution accounts for the greater uncertainty associated with small samples (n decreases variation increases)
what distribution should we use?
sample size vs do you know sigma
sample <30 and ? sigma
t
sample >30 and sigma
z
sample < 30 and ? sigma
z
sample > 30 and sigma
z
when do we use a t distribution
when we do not know sigma AND the sample size is less than 30
t distribution and confidence interval
- Bigger value then t distribution is wider and wider confidence intervals as opposed to the z distribution
- don’t Want this because more risk is with smaller distribution
- don’t Want bigger confidence interval
never compromise vs trade offs
never compromise
1. Small sample
2. High confidence
3. Tight interval
trade-offs
- Built into the formula
- Ideally we want these 3 things, but we cant have them all realistically…