Unit 1 Flashcards
statistics
science of collecting, organizing, analyzing and interpreting data to make decisions
data
information coming from observations, counts, measurements, or responses
quantitative
consists of numerical measurements and counts (quantity)
qualitative
consists of attributes, labels, and non-numeric entries (qualities someone has)
2 types of data sets
population and sample
population
consists of all outcomes, responses, measurements, or counts of interests
sample
subset of population (smaller group from the population)
frequency tables
shows the number of individuals in each “class”/category
frequency (f)
the number of data entries in each class
sample size (n)
the sum of the frequencies
relative frequency
portion/% of data that falls in each class.
Rel. frequency: frequency/sample size
rel. freq. always has a sum of one
cumulative freq
sum of the frequency in that class and all previous classes
mean
average
(mu): population mean
fancy n: sample size for proportion
x bar: sample mean (sum of x/n)
n: sample size for sample
median
value that lies in the middle of a data set (least to greatest)
mode
the number that occurs most often
–> if there are 2 modes, it is known as bimodal
range
difference between the max and min val.
outliers
the number that is far removed from the data set
fractile
numbers that are partitioned or divided into equal parts of an ordered data sets (ex: median)
quartile
divides the data set into 4 equal parts
Q1: first quartile (1/4th of the data)
Q2: second quartile (1/2th of the data, median)
Q3: third quartile (3/4th of the data)
InterQuartile Range (IQR)
measure of variation that gives the range of the middle portion (about 1/2) of the data
IQR: Q3-Q1
How to find outliers
S1) find Q1 and Q3
S2) find IQR (Q3-Q1)
S3) multiply IQR by 1 1/2 (1.5)
S4)
a) Q1-IQR(1.5): number below this are outliers
b) Q3+IQR(1.5): numbers above this are outliers
deviation
difference between the entry (x) and the mean (mu, x bar)
–> the sum of the deviation is always 0
sum of the square (SS sub x)
each deviation is squared. Then added up.