Stats - Week 2 Flashcards
Describing Distributions we look at what?
Measures of shape (Kurtosis and Skewness), central tendency (mean, median, and mode), “spread” or variation (range and variance & standard deviation)
Distribution shapes!
Normal distribution, Positive skew (left), Negative skew (right), Leptokurtic(Positive kurtosis), Platykurtic (Negative kurtosis)
Measures of central tendency =
= estimate the “center” of our data. Mode, Median, and Mean
MODE:
most frequent score in a distribution. A distribution can be bimodal or multimodal.
MEDIAN:
Middle score or 50th percentile
Arrange the scores in ascending order
Median = middle score if # of scores is odd
(average of middle 2 scores if # of scores is even)
MEAN:
Arithmetic average of the scores in a distribution.
The symbol for the mean of a population is omega;
The symbol for the mean of a sample is. (SUM) Mu is population and sample is x-bar
Which is most influenced by the skew: mean, median, or mode?
Mean
Measures of Variation Defined
The more variation in your data, the less precisely you can estimate the population’s location (e.g., mean) from the sample information.
Measures of variation are?
highest score minus lowest score(ie. Data hours of day spent on phone = 3, 4, 6, 7 so 7-3 = range of 4) and sum of squared errors “sum of squares” (gives the total deviation from the mean)(take every data point and subtract from mean and then square it to get rid of negative and then add all together. Want all numbers to be positive so we can actually see a variance.)
Range
(highest score minus lowest score)(ie. Data hours of day spent on phone = 3, 4, 6, 7 so 7-3 = range of 4)
sum of squared errors “sum of squares”
gives the total deviation from the mean
(ie. take every data point and subtract from mean and then square it to get rid of negative and then add all together. Want all numbers to be positive so we can actually see a variance.)
Variance
the average of the sum of squared deviations.
◦Is always a positive number
◦Accentuates the extreme differences
Standard Deviation
the standard deviation of a random variable, sample, statistical population, data set, or probability distribution is the square root of its variance.
Why? A measure of variance that is expressed in the same unit of measurement of the original data. Used more!
Z-scores (or “standard scores)
Tell how many standard deviations a raw score is from the mean.
e.g., z = 1.96 means 1.96 SDs above the sample mean.
I.e., deviations from the mean in SD units.
Any standardized variable has a mean = 0 and SD (& variance) = 1
(Does NOT mean variable is normally distributed)
Permits a standard way to compare across scores / measures
Z scores allow us to determine probabilities.E.g., the probability of a randomly selected student passing a class.
Null Hypothesis “significance Testing” Steps
Step 1: State the hypothesis.
Step 2: Set the criterion for rejecting the null hypothesis.
Step 3: Compute the test statistic.
Step 4: Decide whether to reject the null hypothesis.