DECK 3: UNIT 1 part B (descriptive stats) Flashcards
When can you round?
AT THE VERY END!!! (keep at least 3 digits until end!)
What is a standard deviation?
average (typical) distance to the mean (about). It is how far you expect a random value to be away from the middle.
What is a Z score?
The number of standard deviaiton away from the mean
For information purposes, which gives LEAST… stem-leaf, histogram or box-whisker?
Box/Whisker, BE CAREFUL. you really don’t know how things are distributed. The box and whisker and fish tank give a very GENERAL look.
What is the mode?
the peaks of a histogram (the humps). or with categorical data, the most popular category
What are the percentiles for Q1, med, and Q3?
25, 50 and 75
How do students often mix up IQR and St. Dev
They INCORRECTLY think that Q1 is 1sd below the mean and Q3 is 1sd above the mean. THIS IS NOT TRUE!!! Q1 is only .67 sd above the mean and Q2 is .67 below
Does the IQR capture 68% of the data?
NO. it catches the middle 50%.
What percentile is the median (aka Q2)?
50th
What percent of the data is between Q1 and Q3?
50%
If the mean is above the median, the distribution may be
skewed right… the mean follows the tail
Another name for “skewed right” is
positively skewed
How many SD wide is the IQR in a normal distribution?
NOT 2!!!! Think about it. The middle 68% is 2 sd wide, since the IQR is only the middlest 50% it must be less than 2. try [invnorm(.75)] x2. You find that it is only 1.35 SD wide if the distribution is nearly normal.
What symbols do we use for population mean and sample mean?
Mu for population mean, xbar for sample mean.
What symbols do we use for population standard deviation and sample standard deviation?
Sigma for population and s for sample.
What percent of the data is above Q3?
25%
What percent of the data is below the median?
50%
What is the difference between categorical VARIABLES and categorical DATA?
The Variable is the overall category. Like “EYE COLOR”. The data is the actual measurement from the subjects. Like “blue, brown, blue”
How do you find percentiles and make a boxplot from OGIVE?
Go across till you hit the curve and then STRAIGHT DOWN!
Can numbers be CATEGORICAL?
sure. Zip codes, sports jersey numbers, telephone numbers, social security nunmbers, area codes… these are categorical.
what is the emperical rule?
mean 68-95-99.7 yeah!
When drawing a normal model, what are the PERCENTILES from left to right?
2.5, 16, 50, 84, 97.5
are any populations actually normal?
no, nothing is normal, just normalish. The only normal thing is the model we use.
If the distribution is skewed (or outliers/not symmetric) what would you use for center and spread statistics?
Median (center) and IQR (spread)