Organizing, Visualizing and Describing Data Flashcards
Calculate the Arithmetic mean (average)
Dividing the sum of all values in a data set by the number of values
Calculate the Sample mean
Dividing the sum of all samples by the number of samples
Calculate and explain the Geometric mean
Why are there differences between the arithmetic and geometric mean?
The geometric mean is a way to find the average of a set of numbers that are multiplied together.
In simple terms, it is used to calc the effective rate per period of the holding period return.
To calculate the geometric mean, you need to follow these steps:
- IF there are negative numbers (%), turn all numbers into a proportion (N+1)
- Multiply all the numbers together.
- Take the nth root of the product, where n is the total number of values.
- IF you have turned numbers into a proportion, -1 from the result
Due to variability. If all the values were the same, the means would match. The higher variability, the higher the difference in the two means.
Calculate the Harmonic mean
The Harmonic mean tells you the average price you would pay for a share of stock over multiple periods if you invested the same amount each period.
To calculate the harmonic mean, you need to follow these steps:
Add up the reciprocals of all the numbers.
Divide the total number of values by the sum obtained in step 1.
Let’s use an example to understand how the harmonic mean works. Suppose we have three numbers: 2, 4, and 8. To find the harmonic mean, we’ll first find the reciprocals of these numbers and then add them together:
1/2 + 1/4 + 1/8 = 7/8
Next, we divide the total number of values (which is 3 in this case) by the sum of the reciprocals:
3 ÷ (7/8) = 24/7 ≈ 3.43
- Divide the sum of the reciprocals by the number of reciprocals.
Define the uses for the various means
Arithmetic mean. Estimate the next observation, expected value of a distribution.
Geometric mean. Compound rate of returns over multiple periods.
Trimmed mean. Estimate the mean without the effects of a given percentage of outliers.
Winsorized mean. Decrease the effect of outliers on the mean. a 95% winzorised mean will substitute the 2.5 % lowest and highest outliers for the 97.5 and 2.5 value
Harmonic mean. Calculate the average share cost from periodic purchases in a fixed dollar amount (same monetary amount)
Calculate the Weighted mean
Calculating the weighted average involves:
- Multiplying each data point by its weight and summing those products.
- Then sum the weights for all data points.
- Finally, divide the weight*value products by the sum of the weight
EG Portfolio, Stocks are 50% and the return is 12%
0.5 x 12 To get the weight for stocks. Then sum to all other areas (bonds etc - that is the answer)
What is a Quantile? and What are the different Quantiles
Quantile is the general term for a value at or below which a stated proportion of the data in a distribution lies. Examples of quantiles include the following:
Quartile. The distribution is divided into quarters.
Quintile. The distribution is divided into fifths.
Decile. The distribution is divided into tenths.
Percentile. The distribution is divided into hundredths (percents).
Quantiles can also be expressed as percentiles
The difference between the 1st and 2nd quantile is the interquartile range.
Calculate the position of the observation at a given percentile
What do you need to remember
- Ly=(n+1)y/100
- For results such as 8.4
- count to the 8th value
- subtract the 8th value from the 9th value and times it by .4
Sum the two figures above
n = number of observations
y = quantile / percentile
Calculate ‘Range’
Maximum Value - Minimum Value
Calculate the Mean of Absolute Deviation
The mean absolute deviation (MAD) is the average of the absolute values of the deviations of individual observations from the arithmetic mean.
- Calculate the mean
- Subtract all values (absolute values) from the mean
- Sum those values together
- Divide by the number of observations
aka: Find the average of differences from the mean
Calculate Sample Variance
I think this is the same for population variance too?
- Calculate the mean
- Subtract the mean from the values e.g 11 - mean
- Take the sum of the squared differences from the mean
- Divide by n-1
e.g:
10, 15, 20
n = 3
Mean = 15
(10-15)2 + (15-15)2 + (20,15)2 =
25 + 0 + 25 = 50
50 / 2 = 25
Calculate sample standard deviation
It is the Square Root of the Sample Variance
Explain how issues arise comparing two measures of dispersion
A direct comparison between two or more measures of dispersion may be difficult. For instance, suppose you are comparing the annual returns distribution for retail stocks with a mean of 8% and an annual returns distribution for a real estate portfolio with a mean of 16%. A direct comparison between the dispersion of the two distributions is not meaningful because of the relatively large difference in their means. To make a meaningful comparison, a relative measure of dispersion must be used. Relative dispersion is the amount of variability in a distribution relative to a reference point or benchmark. Relative dispersion is commonly measured with the coefficient of variation (CV)
Calculate relative dispersion / Coefficient of variation
How do you interpret the results?
Standard Deviation of X
/
Average Value of X
Higher value = Higher dispersion (risk)
Calculate and interpret target downside deviation
The calculation is the same as standard deviation, but you ONLY include values below the ‘target’ value
Remember to still include all values in the division (e.g n-1)