Numerical Descriptive Data Flashcards
allow you to characterize your data into several properties
Descriptive Measures
the three commonly used measures under descriptive study are:
- measures of the center
- measures of the location, - - measures of dispersion
is a single value that represents the center point of a dataset. It helps to summarize the data and describe its main characteristics
Measure of Central Tendency
also known as AVERAGE and the most used measure of the center
Mean
Also known as Mean
AVERAGE
to compute the mean?
simply add all the values in the given data set and divide the summation by the count or number of values
is the symbol of mean and is also called x-bar
x̄
is the score found in the middle of an arranged data set
Median
to find for the median?
simply arrange the scores or data set either in ascending or descending order then use the formula Median = (n+1/2)^th member
is the symbol of median and is also called x-tilde
x̃
if the number of values in the data set is even
arrange the scores or data set either in ascending or descending order then get the average of the two middlemost scores
the most frequently occurring score in the data set
Mode
is the symbol of mode and is also called x-hat
x̂
to determine the mode?
you might again order the scores and then count the frequency of each value
can mode not be unique?
Yes
A distribution may also contain (?) or more modes (?)
- bimodal
- multimodal
or dispersion measure how spread out a set of data is
Measure of Variation
when the data set obtained a large value of measure of variation,
then the values are said to be scattered (high variability)
when the data set obtained a small value of measure of variation
then the values are said to be clustered or closed to one another (low variability)
is the difference in the maximum and minimum values of a data set
Range
the range is ________ but it is very much affected by __________
easy to calculate
extreme values
is the expectation of the squared deviation of a random variable from its mean, and it informally measures how far a set of (random) numbers are spread out from their mean
Variance
is the most common measure of variation. It provides a numerical measure of the overall amount of variation in a data set and can be used to determine whether a particular data value is close to or far from the mean
Standard Deviation
if the number belong to a population, in symbols a deviation is
X - μ
For sample data, in symbols a deviation is
x - x̄
represents the population standard deviation
the greek letter σ (sigma, lowercase)
represent the sample standard deviation
the lower-case letter s
Variance and Standard Deviation?
- to calculate the standard deviation, we need to calculate the variance first
- the variance is the average of the squares of the deviations
- deviation of values for a for a sample is (X-Mean), while deviation of values for a population is (x-μ)
STEPS IN COMPUTING THE VARIANCE AND STANDARD DEVIATION
- compute the mean
- subtract each of the data from the mean
- square the deviations, then find the total sum
- apply the formula to compute for the variance
- then take its square root to get the standard deviation
Measure of Central Tendency
- Mean
- Median
- Mode
Measure of Variation
- Range
- Variance
- Standard Deviation