Lecture 3 Flashcards
Central tendency
the tendency of data to cluster, or center, about certain numerical values
variability
the spread of the data
mean
the sum of observed values in a data set divided by the number of observations
median
middle number when the measurements are arranged in ascending (or descending) order; if the number of observations is odd, then the sample median is the observed value exactly in the middle of the ordered list; if the number of observations is even, then the sample median is the number halfway between the two middle observed values in the ordered list
mode
the most frequently occurring data element
x with line over it
sample mean
mu symbol
population mean
M
sample median
n with long tail
population median
the sample mean is often use to estimate…?
the population mean
the accuracy of using the sample mean to estimate the population mean depends on?
size of the sample and variability of data
right skew?
typically the median is less than the mean
if the data set is symmetric then?
the mean equals the median
if the data set is skewed to the left then?
typically the mean is less than the median
when do choose mode?
when calculating measure of center for the qualitative variable
when to choose mean?
variable is quantitative with symmetric distribution
when to choose median?
quantitative variable with skewed distribution
range
the distance between the largest measurement and the smallest measurement in a data set
sample standard deviation
formula is minusing each value with the sample mean and squaring that and then dividing that by n-1 and square root the whole thing
s^2 symbol
sample variance
sigma^2 symbol
population variance
s symbol
sample standard deviation
sigma symbol
population standard deviation
formula for sample variance
same as standard deviation but without the square root