normal distribution and estimation Flashcards
differences for symmetric and asymmetric quantitative data
symmetric use mean and SD (bell shaped ND)
asymmetric use median min/max and Q1 and Q3
what is the sample and the population
sample is the data we use
pop. is theoretical group of all people we could have sampled (and from where the sample came from)
most samples bias but try remain representative
random sample is where each member of pop. has equal chance of being selected
subjective inference
using data from sample we can infer qualities for the pop. as a whole
N(u, o^2)
u-mean
o-SD
o^2 - variance
n(not N) - number in sample
check normality
symmetric box-lot, mean roughly equals median
mean+- 2 SD contains most of data
SD of sample means is known as the standard error
SE=pop.SD/(square root of n)
estimated SE uses sample SD
what is the SD
measure of variability of data
confidence interval
how well population mean is estimated
95% of sample means are +- 1.96 SE of population mean
so if sample mean (xbar) is within +-1.96SE of population mean then sample mean (xbar)+-1.96SE contains population mean(u)
95%confidence interval
will contain pop.mean with 95% confidence
confidence interval assumes large sample
> 30