7.2: Means and Variance Flashcards
What is measures of central tendency? What can it be used to represent?
Measures of central tendency identify the centre, or average, of a data set.
It can be used to represent the typical, or expected, value in the data set.
How is population mean computed?
Population mean = sum of all observed values in the population / number of observations in the population.
What is sample mean and how is it computed?
Sample mean = sum of all the values in a sample of a population / number of observations in the sample.
Sample mean is used to make inferences about the population mean.
Define arithmetic mean and provide examples. What is it used to measure and what properties does it have (4) ?
Arithmetic mean is the sum of the observation values divided by the number of observations.
Examples of arithmetic means are population mean and sample mean.
It is the most widely used measure of central tendency and has the following properties:
- All interval and ratio data sets have an arithmetic mean.
- All data values are considered and included in the arithmetic mean computation.
- A data set has only one arithmetic mean such that the arithmetic mean is unique.
- The sum of the deviations of each observation in the data set from the mean is always zero.
What is the sum of mean deviations?
Sum of mean deviations = zero
What is weighted mean and how is it computed?
Weighted mean recognizes that different observations may have a disproportionate influence on the mean.
The weighted mean of a set of numbers is computed as the sum of observed value multiplied by its corresponding weight.
Define median and why it is important.
Median is the midpoint of a data set when the data is arranged in ascending or descending order.
Median is important because the arithmetic mean can be affected by outliers and when this happens, median is a better measure of central tendency than the mean because it is not affected by extreme values that may actually be the result of errors in the data.
Define mode, unimodal, bimodal, and trimodal.
Mode is the value that occurs the most frequently in a data set.
A data set may have more than one mode or no mode.
Unimodal: one value that appears frequently
Bimodal: two values that appear frequently
Trimodal: three values that appear frequently.
Define geometric mean and how it is computed. How is the computation of the geometric mean for a returns data set different?
Geometric mean is often used when calculated investment returns over multiple periods or when measuring compound growth rates.
Geometric mean is computed as follows:
G = (X1 x X2 x …. x Xn)^(1/n)
Geometric mean for a returns data set is computed as follows:
1 + Rg = [(1 + R1) x (1 + R2) x … x (1 + Rn)]^(1/n)
where Rg is the geometric mean return and Rn is the return for period t.
What is the relationship between geometric mean and arithmetic mean?
Equal when there is no variability in the observations (observations are equal). Geometric mean is always less than or equal to the arithmetic mean and the difference increases as the dispersion of the observations increases.
When is harmonic mean used and how is it computed?
Harmonic mean is used for certain computations, such as the average cost of shares purchased over time.
Harmonic mean = number of observations / sum of 1 over value of observation
What is the relationship between harmonic mean, arithmetic mean, and geometric mean?
When values are not equal: harmonic mean < geometric mean < arithmetic mean
Define quantile.
Quantile is the general term for a value at or below which a stated proportion of the data in a distribution lies.
Examples of quantiles include:
- Quartiles (the distribution is divided into quarters)
- Quintiles (the distribution is divided into fifths)
- Decile (the distribution is divided into tenths)
- Percentiles (the distribution is divided into hundredths)
How is the position of the observation at a given percentile?
Ly = (n +1)(y/100)
where y is the given percentile and n is the data points sorted in ascending order.
What are the two measures of location?
Measures of location are quantiles and measures of central tendency.