Quant - Stat. Measures of Asset Returns Flashcards
How do you calculate the arithmetic mean?
The arithmetic mean is the sum of the observations divided by the number of observations.
What are three “measures of central tendency” and what is the purpose of a “measure of central tendency”?
Measures of central tendency include the mean, the median and the mode, and specify where data are centered.
How do you calculate the median value?
The median is the value of the middle item of observations, or the mean of the values of the two middle items, when the items in a set are sorted into ascending or descending order.
When is the median more useful than the mean?
Since the median is not influenced by extreme values, it is most useful in the case of skewed distributions.
How do you calculate the mode?
The mode is the most frequently observed value and is the only measure of central tendency that can be used with nominal or categorical data. A distribution may be unimodal (one mode), bimodal (two modes), trimodal (three modes), or have even more modes.
What are quartiles?
Quantiles, as the median, quartiles, quintiles, deciles, and percentiles, are location parameters that divide a distribution into halves, quarters, fifths, tenths, and hundredths, respectively.
How do you calculate the first, second, and third quartiles given a data set?
To calculate the quartiles, you first arrange your data in ascending order.
Q1 can be calculated by finding the median of the first half of your dataset.
Q2 is the median of the dataset.
Q3 is found by calculating the median of the second half of your dataset.
If the dataset has an odd number of observations, the median itself is typically excluded from the halves when finding Q1 and Q3.
What does the “Box” in a “Box and Whiskers Plot” represent?
A box and whiskers plot illustrates the distribution of a set of observations. The “box” depicts the interquartile range, the difference between the first and the third quartile. The “whiskers” outside of the “box” indicate the others measures of dispersion.
What is “dispersion”?
In statistics, dispersion (or variability) refers to how spread out or scattered the values in a data set are. It’s a measure of how much the data points differ from each other and from the central tendency (mean, median, or mode) of the distribution.
What are some measures of dispersion?
range
variance
standard deviation
coefficient of variation
What is “range” and how do you calculate it?
Range: The simplest measure of dispersion, which is the difference between the maximum and minimum values in a data set. It gives a quick sense of the breadth of the values but doesn’t account for how the data is distributed between these extremes.
What is “inter quartile range” and how do you calculate it?
Interquartile Range (IQR): This measure focuses on the middle 50% of the data, calculated as the difference between the third quartile (Q3) and the first quartile (Q1). It helps mitigate the effect of outliers and provides a clearer picture of the central spread of data.
What is “variance” and how do you calculate it?
Variance: A more comprehensive measure that describes the average squared deviations from the mean. By squaring the differences, variance weighs outliers more heavily, making it sensitive to extreme values.
What is the benefit of “variance” and what is the issue with “variance’?
Benefit: can handle negative numbers.
Issue: by squaring the differences, variance weighs outliers more heavily, making it sensitive to extreme values.
What is “standard deviation” and how is it calculated?
Standard Deviation: The square root of the variance, which brings the measure of dispersion back into the units of the original data.
Which is more commonly used and why?
- Standard Deviation
- Mean
Standard Deviation: It’s one of the most widely used statistics for measuring dispersion because it is interpretable in the context of the mean.