(8) Statistical Concepts and Market Returns Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

LOS 8. a: Distinguish between descriptive statistics and inferential statistics, between a population and a sample, and among the types of measurement scales.

A

Descriptive statistics summarize the characteristics of a data set;

inferential statistics are used to make probabilistic statements about a population based on a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

LOS 8. a: Distinguish between descriptive statistics and inferential statistics, between a population and a sample, and among the types of measurement scales.

A

A population includes all members of a specified group,

while a sample is a subset of the population used to draw inference about the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

LOS 8. a: Distinguish between descriptive statistics and inferential statistics, between a population and a sample, and among the types of measurement scales.

A

Data may be measured using different scales:

Nominal scale – data is put into categories that have no particular order

Ordinal scale – data is put into categories that can be ordered with respect to some characteristic.

Interval scale – differences in data values are meaningful, but ratios, such as twice as much or twice as large, are not meaningful

Ratio scale – ratios of values, such as twice as much or half as large, are meaningful, and zero represents the complete absence of the characteristic being measured.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

LOS 8. b: Define a parameter, a sample statistic, and a frequency distribution.

A

Any measurable characteristic of a population is called a parameter

A characteristic of a sample is given by a sample statistic

A frequency distribution groups observation into classes, or intervals. An interval is a range of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

LOS 8. c. Calculate and interpret relative frequencies and cumulative relative frequencies, given a frequency distribution.

A

Relative frequency is the percentage of total observations falling within an interval.

Cumulative relative frequency for an interval is the sum of the relative frequencies for all values les than or equal to the interval’s maximum value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

LOS 8. d: Describe the properties of a data set presented as a histogram or a frequency polygon.

A

A histogram is a bar chart of data that has been grouped into a frequency distribution.

A frequency polygon plots the midpoint of each interval on the horizontal axis and the absolute frequency for that interval on the vertical axis, and connects the midpoints with straight lines.

The advantage of histograms and frequency polygons is that we can quickly see where most of the observations lie.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

LOS 8. e: Calculate and interpret measures of central tendency, including the population mean, sample mean, arithmetic mean, weighted average or mean, geometric mean, harmonic mean, median and mode.

A

The arithmetic mean is the average. Population mean and sample mean are examples of arithmetic means.

The geometric mean is used to find a compound growth rate.

The weighted mean weights each value according to its influence

The harmonic mean can be used to find an average purchase price, such as dollars per share for eual periodic investments.

The median is the midpoint of a data set when the data is arranged from largest to smallest.

The mode of a data set is the value that occurs most frequently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

LOS 8. f: Calculate and interpret quartiles, quintiles, deciles, and percentiles.

A

Quantile is the general term for a value at or below which is stated proportion of the data in a distribution lies. Examples of quantiles include:

Quartiles – the distribution is divided into quarters.

Quintile – the distribution is divided into fifths.

Decile – the distribution is divided into tenths.

Percentile – the distribution is divided into hundredths (percent).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

LOS 8. g: Calculate and interpret 1) a range and a mean absolute deviation and 2) the variance and standard deviation of a population and of a sample.

A

The range is the difference between the largest and smallest values in a data set.

Mean absolute deviation (MAD) is the average of the absolute values of the deviations from the arithmetic mean:

Variance is defined as the mean of the squared deviations from the arithmetic mean or from the expected value of a distribution.

Standard deviation is the positive square root of a variance and is frequently used as a quantitative measure of risk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Biased estimator (give example)

A

In statistics, the bias (or bias function) of an estimator is the difference between this estimator’s expected value and the true value of the parameter being estimated. An estimator or decision rule with zero bias is called unbiased. Otherwise the estimator is said to be biased. In statistics, “bias” is an objective property of an estimator, and while not a desired property, it is not pejorative, unlike the ordinary English use of the term “bias”.

When we divide by (n1) when calculating the sample variance, then it turns out that the average of the sample variances for all possible samples is equal the population variance. … Dividing by n does not give an “unbiased” estimate of the population standard deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

LOS 8. h: Calculate and interpret the proportion of observations falling within a specified number of standard deviations of the mean using Chebyshev’s inequality.

A

Chebyshev’s inequality states that the proportion of the observations within k standard deviations of the mean is at least 1 – 1/k^2 for all k > 1. It states that for any distribution, at least.

36% of observations lie within +/- 1.25 standard deviations of the mean.

56% of observations lie within +/- 1.5 standard deviations of the mean.

75% of observations lie within +/- 2 standard deviations of the mean.

89% of observations lie within +/- 3 standard deviations of the mean.

94% of observations lie within +/- 4 standard deviations of the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

LOS 8. i: Calculate and interpret the coefficient of variation and the Sharpe ratio.

A

The Coefficient of variation for sample data, CV = s / Xbar, is the ratio of the standard deviation of the sample to its mean (expected value of the underlying distribution).

The Sharpe ratio measures excess return per unit of risk:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Relative dispersion

A

The relative dispersion of a data set, more commonly referred to as its coefficient of variation, is the ratio of its standard deviation to its arithmetic mean. In effect, it is a measurement of the degree by which an observed variable deviates from its average value. It is a useful measurement in applications such as comparing stocks and other investment vehicles because it is a way to determine the risk involved with the holdings in your portfolio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

LOS 8. j: Explain skewness and the meaning of the positively or negatively skewed return distribution.

A

Skewness describes the degree to which a distribution is not symmetric about its mean. A right-skewed distribution has positive skewness. A left-skewed distribution has negative skewness.

Sample skew with an absolute value greater than 0.5 is considered significantly different from zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

LOS 8. k: Describe the relative locations of the mean, median, and mode for a unimodal, nonsymmetrical distribution.

A

For a positively skewed, unimodal distribution, the mean is greater than the median, which is greater than the mode.

For a negatively skewed, unimodal distribution, the mean is less than the median, which is less than the mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

LOS 8. l: Explain measures of sample skewness and kurtosis.

A

Kurtosis measures the peakedness of a distribution and the probability of extreme outcomes (thickness of tails).

  • Excess kurtosis is measured relative to a normal distribution, which has a kurtosis of 3.
  • Positive values of excess kurtosis indicate a distribution that is leptokurtic (fat tails, more peaked) so that the probability of extreme outcomes is greater than for a normal distribution.
  • Negative values of excess kurtosis indicate a platykurtic distribution (thin tails, less peaked).
  • Excess kurtosis with an absolute value greater than 1 is considered significant.
17
Q

Interpret kurtosis

A

To interpret kurtosis, note that it is measured relative to the kurtosis of a normal distribution, which is 3. Positive values of excess kurtosis indicate a distribution that is leptokurtic (more peaked, fat tails), whereas negative values indicate a platykurtic distribution (less peaked, thin tails). Excess kurtosis values that exceed 1.0 in absolute value are considered large. We can calculate kurtosis relative o that of a normal distribution as:

Excess kurtosis = Sample kurtosis - 3

18
Q

LOS 8. m: Compare the use of arithmetic and geometric means when analyzing investment returns.

A

The arithmetic mean return is appropriate for forecasting single period returns in future period, while the geometric mean is appropriate for forecasting future compound returns over multiple periods.