Lecture 2 Flashcards
what are 2 measures of dispersion?
how are they related to each other?
- variance
- standard deviation
standard deviation = sqrt (variance)
what are the two different types of estimators?
how do they differ?
- sample statistics - latin letters
- population statistics - greek letter
> sample statistics are random variables, they vary from sample to sample
> population statistics are parameters, they have a fixed (but mostly unkown) value
estimation of population variance
> when is estimator of population variance biased?
> when not?
population variance estimator
> is unbiased if population mean is known
> is biased if population mean is unkown, thus based on sample mean
> sample mean varies per sample, estimator of population variance mostly too small
if population mean is unknown
> solution for systematic underestimation of population variance?
underestimation of population variance due to unkown population mean
> divide by n-1 instead of n
what is the general principle behind the degrees of freedom?
whenever we use an estimated parameter, we lose a degree of freedom
how big is the total area below the curve of a standard normal distribution?
1
what are the one sided 5% cutoff values?
two sided 5% cutoff value?
one side cutoff 5%: 1.65
two sided cutoff 5%: 1.96
what is the SEM
SEM: standard error of the mean
> indication of reliability of m as an estimator of mu
what are the 3 assumptions of the central limit theorem
central limit theorem
- the mean of the distribution equals the population mean
- the standard deviation of the distribution of means equals SEM
- n increases, the shape of the sampling distribution of means approaches that of the normal distribution, regardless the shape of the population distribution
hypotheses always relate to …?
hypothese always relate to population statistics
what are 3 influences on the t test statistic?
t is more extreme as
- effect (m1-m2) is more extreme
- spread (s1-s2) is smaller
- n is greater
when does the t distribution approximate the z distribution?
only for large sample sizes
solution: student t distribution (william gosset)
how to treat matched pair designs with refard to analysis?
matched pair designs are classified along with repeated measure designs
> paired data