numerical summary of measures Flashcards
frequency distribution from continuous data are defined by
types of descriptors aka parameters
-central tendency
-dispersion
Defined as the value used to represent the center or the middle of a set of data values.
central tendency
it locates observation on a measurement scale
central tendency
Describes the spread of values in a given data set
dispersion
Suggest how widely spread out the observations are
dispersion
the mean bar is used in
quantitative and dichotomous data
- the mean provides a single no. that summarizes the data, making it easier to understand and compare datasets
it is the average value or the sum of all the observed values divides by the total no. of observation
mean
the middle observation data
median
the most commonly observed value
mode
how can the data be misinterpreted in mean
due to the extreme value, the data wiil not fall in the middle
how do u find mean, median and mode in excel
mean: =AVERAGE
median: =MEDIAN
mode: =MODE
measure of dispersion is aka
measures of spread
what can u infer if your measures of spread is
dispersed:
clumped:
dispersed: kalat kalat not near each other
clumped: near the average value
it is a statical measurement of the spread between numbers and measures how far each number in the set is from the average
variance
it is the average amount of variability in the dataset and how far each value lies from the mean
standard deviation
the difference between observed value and the expected value
mean deviation
mean deviation is aka
deviation in statistics
it is the average deviation of a data point from the mean, media or mode of the data set
mean deviation
how to find range
subtract highest value and lowest value
*used in only small data
how can the range be misinterpreted
have a high chance to have a large outlier as the larger values is taken into account
N-1
degrees of freedom
*to have room for error
standardize the spread
coefficient of variation
values that spilt sorted data or a probability distribution into equal parts
quantiles
a statical term that describes a division of observation into four defined intervals based on the values oof the data and how they compare to the entire set of observations
quartiles
in descriptive statistics, the range of data set is size of the narrowest interval which contains all the data
range and IQR
what measures of central tendency is considered robust to extreme values
median
extreme values is aka outliers.
the median ignore the extremes and even if u change the most extreme values dramatically, the median often stays the same or changes very little.
median remains stable despite an extreme value, while the mean is heavily affected