sampling distribution and estimation Flashcards
describe the sample mean ?
it is an estimate
a random variable drawn from the population
name the different types of distribution ?
- normal
- binomial
- chi squared
- f distribution
- t distribution
describe a normally distributed sample
- sample size of n
- sample mean xbar = sum of x / n
- variance = sigma(sd)^2 / n
- population mean is mew
- z transformation = sample mean - popmean / st,e
- standard error = root ( sigma^2 /n)
what is central limit theorem ?
the theory that as long as n > 25, you can use the methods you would use on normally distributed data to analyse it.
what is ?
a. an unbiased estimator ?
b. a biased estimator ?
a. an estimator ( for the mean ) that = the true unknown parameter
b. doesnt equal the true value
(mean strays from the centre of the distribution)
for a normally distributed sample, how do you estimate the population variance ?
the sample variance is an unbiased estimator so we calculate this:
s^2 = sum of ( xi - sampmean)^2 / n - 1 = sigma^2
what is high versus low precision ?
high precision = low variance
low precision = high variance
the steeper the distribution the lower the variance and the more preferred
do we prefer data to be precise or unbiased ?
generally we prefer unbiased data
for a sample with either N>25 or norm dist, how do you construct a confidence interval ?
first find the critical values for the size interval you need
transform these into critical values of the sampling mean
c L = sample mean - critval . sigma/root n
c U = “ but + not -
for a sample with either N>25 or norm dis, how do you estimate a population proportion ?
- sample size n
- sample prop p
- r no of successes
- pop prop = pie
we us binomial because there are only two options for outcomes, there fore the sample size must be > 25
p = r/n p = mean variance = pie (1-pie) / n
cU/L = p +/- Zcv.standardd (which is root( (p.(1-p)) / n)
what is the t distribution ?
it is very similar to normal distribution but it has one parameter V - degrees of freedom
how do you calculate confidence intervals or anything with t distribution ?
you calculated the same and normal but with different cvs.
v = n-1
t = sample mean - population mean / root (sample variance /n )
how do you conduct a hypothesis test ?
1) formulate the null and alternative hypotheses
2) chose the level of significance of the test
3) look up the correct critical values and set rejection region
4) calculate the test statistic
5) compares test stat to re region and make decision about hypotheses acceptance
how do you calculate the test statistic for
a. std normal
b. t dist
a. z = x - population mean / standard deviation
b. t = sample mean - population mean / root (sample variance /n )
what is type || error probability ?
the probability that we fail to reject the null hypothesis even when its true
the prob is equal to the area under the H1 is true distribution, to the left of the upper cv of the H0 is true distribution