Midterm (also look at 1 and 2) Flashcards
z-scores
Tell us how many SDs a score is above or below the mean
z = deviation/ standard deviation
Standard normal distribution
- Perfectly symmetrical
- Bell-shaped: high probability in the center and lower probability in tails
- Area under it is = to 1; collectively all scores have a proportion = to 1 (percent = 100)
- Mean, median, mode are at center of distribution
- The tails (away from center) are asymptotic
- Defined by mathematical formula- a model for data of -
many variables - Areas under it are known
Probability
For a random event, a ratio- a number of ways the event can occur divided by the total number of possible events (sample space)
p (x) = f (x) / sample space
Properties of sample probabilities
Total range from 0 to 1 - 0 means event is impossible - 1 means the event is certain Can never be negative - Either possible or impossible
What is the difference between relative frequency and probability?
probability helps us PREDICT what could happen (whereas rf tells us what has already happened)
Probability Distributions
Goal: interest in knowing the probability of each possible outcome of a random variable
Sum of probabilities is 1
Also interested in:
Expected values- mean or avg. expected outcome of a random variable
The variance and standard deviation of a probability distribution- avg (typical) distance of outcomes from the expected value
2 goals of inferential statistics
Make unbiased estimates
Assess error made when making estimates
— Sampling Dist. does this
Sampling goal
obtain a subset of population that represents population
use random sampling
- Each element in pop. has = chance of being chosen
- Each selection is independent of other selections
sampling error
estimation of parameters involves this
Sampling Distribution
Distribution of a STATISTIC derived from all possible samples of a given size
Can imagine it but can never see it (because we don’t have all of the data)
But it is useful for:
- Establishing the rationale for estimation of population parameters from statistics
- Assessing the amount of error we are likely to make when using a statistic to estimate a parameter
Unbiased estimator
A statistic, over all possible samples of a given size, has a mean value = to the pop. parameter
Proven that:
M is an unbiased estimator of μ
- in the long run overestimates are offset by underestimates
Standard error
We could determine the typical amount of error we could expect to make by calculating SD of relevant sampling distribution
the SD of the sampling distribution
standard error of the mean
standard error for an estimate of the mean
Sm is an unbiased estimator of the typical amount of error made when using M to estimate μ
how do we get an unbiased estimator of pop. variance and SD?
By dividing degrees of freedom (n - 1) instead of sample size (n)
Degrees of freedom
the # of ways something is free to vary (the # of independent pieces of info needed to determine a statistic
For measures of variability (s2 and s), df = n –1, because once all but one deviation from the mean is known, the last one is set (is no longer free to vary)