Chapter 2: Hypothesis testing Flashcards
1
Q
- Parameter and statistic
- Statistical Inference
- Pop vs Sampling distribution
- Sampling bias and variability
A
- Parameter is charac of population, true value not known (u,sigma,N). Statistic is charac of a sample used to estimate the para. (x-bar,s,n)
- Involves using info from the sample to draw conclusions regarding wider population.
- Pop distri is the distri of values of variables among all individuals of the population. It is the distribution of values taken by statistic in all possible SAMPLES of same SIZE and from same POPULATN.
- Bias: how close to CENTER (accuracy); Variability: how widely SPREAD (precision). Sampling bias can be eliminated by RANDOM sampling and to eliminate sampling variability chose larger population. But variability will not depend o saple size if population is 100 times greater than sample.
2
Q
- Mean of sampling distribution of sample mean
- SD of sampling distribution of sample mean
- CENTRAL LIMIT THEOREM (CLT)
A
- Sample means less variable and more normally distri than indi obs. Thus mean of sampling distribution is unbiased estimator of population mean. Ux-bar=u
- SD of sampling distri SMALLER than population SD.
- Sigma(x-bar)=sigma/under-root(n)
- Larger sample size, sampling distribution of x-bar is closer to normal distribution irrespective of the population distribution as long as population has finite SD. SAMPLING DISTRIBUTION OF SAMPLE MEAN X-BAR is approx. Normal=> N(u,sigma/under-root n)
3
Q
How large sample size required for sampling distri of x-bar to be normal ?
A
More obs are req if population distribution is far from normal i.e. more skewed.
4
Q
Estimating with confidence
A
68-95-99.7 rule says that probability is about 0.95 that x-bar will be within 2 SD of mu. Then U is within 2 SD of x-bar.
95% OF ALL SAMPLES WILL CONTAIN TRUE U IN THE INTERVAL FROM X-2SD TO X+2SD
5
Q
CONFIDENCE INTERVALS
A
CI= ESTIMATE+ MARGIN OF ERROR;
Sigma(x-bar)=4.5 ; 95% of all samples will capture u in the interval x-bar+/-9
6
Q
CONFIDENCE LEVEL (C)
A
Overall capture rate if method used many times; probability of producing the interval that captures unk para
C:0-1