Reading 10 - Common Probability Distribution Flashcards

1
Q

Reading 10 – Common Probability Distributions – Learning outcomes

A

The candidate should be able to:

define a probability distribution and distinguish between discrete and continuous random variables and their probability functions;

describe the set of possible outcomes of a specified discrete random variable;

interpret a cumulative distribution function;

calculate and interpret probabilities for a random variable, given its cumulative distribution function;

define a discrete uniform random variable, a Bernoulli random variable, and a binomial random variable;

calculate and interpret probabilities given the discrete uniform and the binomial distribution functions;

construct a binomial tree to describe stock price movement;

calculate and interpret tracking error;

define the continuous uniform distribution and calculate and interpret probabilities, given a continuous uniform distribution;

explain the key properties of the normal distribution;

distinguish between a univariate and a multivariate distribution and explain the role of correlation in the multivariate normal distribution;

determine the probability that a normally distributed random variable lies inside a given interval;

define the standard normal distribution, explain how to standardize a random variable, and calculate and interpret probabilities using the standard normal distribution;

define shortfall risk, calculate the safety-first ratio, and select an optimal portfolio using Roy’s safety-first criterion;

explain the relationship between normal and lognormal distributions and why the lognormal distribution is used to model asset prices;

distinguish between discretely and continuously compounded rates of return and calculate and interpret a continuously compounded rate of return, given a specific holding period return;

explain Monte Carlo simulation and describe its applications and limitations;

compare Monte Carlo simulation and historical simulation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Probability distribution

A

A distribution that specifies the probabilities of a random variable’s possible outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Random variable

A

A quantity whose future outcomes are uncertain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The two basic types of random variables

A

Are discrete random variables and continuous random variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Discrete random variable

A

A random variable that can take on at most a countable number of possible values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Continuous random variable

A

A random variable for which the range of possible outcomes is the real line (all real numbers between −∞ and +∞ or some subset of the real line).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Probability function

A

A function that specifies the probability that the random variable takes on a specific value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

We can view a probability distribution in two ways:

A

Probability function

Probability density function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Probability function

A

A function that specifies the probability that the random variable takes on a specific value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Probability density function

A

A function with non-negative values such that probability can be described by areas under the curve graphing the function.

For continuous random variables, the probability function is denoted f(x) and called the probability density function (pdf), or just the density.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A probability function has two key properties (which we state, without loss of generality, using the notation for a discrete random variable):

A
  • 0 ≤ p(x) ≤ 1, because probability is a number between 0 and 1.
  • The sum of the probabilities p(x) over all values of X equals 1. If we add up the probabilities of all the distinct possible outcomes of a random variable, that sum must equal 1.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Cumulative distribution function

A

A function giving the probability that a random variable is less than or equal to a specified value.

For both discrete and continuous random variables, the shorthand notation is F(x) = P(Xx).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The cdf has two other characteristic properties:

A
  • The cdf lies between 0 and 1 for any x: 0 ≤ F(x) ≤ 1.
  • As we increase x, the cdf either increases or remains constant.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bernoulli random variable

A

A random variable having the outcomes 0 and 1.

named after the Swiss probabilist Jakob Bernoulli (1654–1704)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Bernoulli trial

A

An experiment that can produce one of two outcomes.

If we let Y equal 1 when the outcome is success and Y equal 0 when the outcome is failure, then the probability function of the Bernoulli random variable Y is

  • p(1) = P(Y* = 1) = p
  • p(0) = P(Y* = 0) = 1 – p

where p is the probability that the trial is a success.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does the hat on the p indicated for bernoullis trial?

A

The “hat” over p indicates that it is an estimate of p.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Binomial random variable

A

The number of successes in n Bernoulli trials for which the probability of success is constant for all trials and the trials are independent.

In n Bernoulli trials, we can have 0 to n successes. If the outcome of an individual trial is random, the total number of successes in n trials is also random. A binomial random variable X is defined as the number of successes in n Bernoulli trials. A binomial random variable is the sum of Bernoulli random variables Y**i, i = 1, 2, …, n:

X = Y1 + Y2 + … + Y**n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

The binomial distribution makes these assumptions:

A
  • The probability, p, of success is constant for all trials.
  • The trials are independent.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Binomial probability function

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

If the price you trade at is fair, then?

A

50 percent of the trades you do with a broker should be profitable.

Of course, you need to adjust for the direction of the overall market after the trade (any broker’s record will be helped by a bull market) and perhaps make other risk adjustments. Assume that these adjustments have been made.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Tracking error

A

The standard deviation of the differences between a portfolio’s returns and its benchmark’s returns; a synonym of active risk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Mean and variance of binomial random variables equations

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Mean and variance of binomial random variables explained

A

Because a single Bernoulli random variable, Y ~ B(1, p), takes on the value 1 with probability p and the value 0 with probability 1 − p, its mean or weighted-average outcome is p. Its variance is p(1 − p).10 A general binomial random variable, B(n, p), is the sum of n Bernoulli random variables, and so the mean of a B(n, p) random variable is np. Given that a B(1, p) variable has variance p(1 − p), the variance of a B(n, p) random variable is n times that value, or np(1 − p), assuming that all the trials (Bernoulli random variables) are independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Binomial model

A

A model for pricing options in which the underlying price can move to only one of two possible new prices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Up transition probability

A

The probability that an asset’s value moves up.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Down transition probability

A

The probability that an asset’s value moves down in a model of asset price dynamics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Binomial tree

A

The graphical representation of a model of asset price dynamics in which, at each period, the asset moves up with probability p or down with probability (1 – p).

28
Q

Continuous uniform distribution

A

The continuous uniform distribution is the simplest continuous probability distribution. The uniform distribution has two main uses. As the basis of techniques for generating random numbers, the uniform distribution plays a role in Monte Carlo simulation. As the probability distribution that describes equally likely outcomes, the uniform distribution is an appropriate probability model to represent a particular kind of uncertainty in beliefs in which all outcomes appear equally likely.

29
Q

The pdf for a uniform random variable is:

A
30
Q

The mathematical operation that corresponds to finding the area under the curve of a pdf f(x) from a to b is the integral of f(x) from a to b:

A
31
Q

For a continuous uniform random variable, what is the mean and variance?

A

The mean is given by μ = (a + b)/2 and the variance is given by σ2 = (ba)2/12.

32
Q

Central limit theorem

A

The central limit theorem states that the sum (and mean) of a large number of independent random variables is approximately normally distributed.

33
Q

The defining characteristics of a normal distribution are as follows:

A
  • The normal distribution is completely described by two parameters—its mean, μ, and variance, σ2. We indicate this as X ~ N(μ, σ2) (read “X follows a normal distribution with mean μ and variance σ2”). We can also define a normal distribution in terms of the mean and the standard deviation, σ (this is often convenient because σ is measured in the same units as X and μ). As a consequence, we can answer any probability question about a normal random variable if we know its mean and variance (or standard deviation).
  • The normal distribution has a skewness of 0 (it is symmetric). The normal distribution has a kurtosis (measure of peakedness) of 3; its excess kurtosis (kurtosis − 3.0) equals 0.17 As a consequence of symmetry, the mean, median, and the mode are all equal for a normal random variable.
  • A linear combination of two or more normal random variables is also normally distributed.
34
Q

Univariate distribution

A

A distribution that specifies the probabilities for a single random variable.

35
Q

Multivariate distribution

A

A probability distribution that specifies the probabilities for a group of related random variables.

36
Q

Multivariate normal distribution

A

A probability distribution for a group of random variables that is completely defined by the means and variances of the variables plus all the correlations between pairs of the variables.

37
Q

A multivariate normal distribution for the returns on n stocks is completely defined by three lists of parameters:

A
  • the list of the mean returns on the individual securities (n means in total);
  • the list of the securities’ variances of return (n variances in total); and
  • the list of all the distinct pairwise return correlations: n(n − 1)/2 distinct correlations in total.

For example, a distribution with two stocks (a bivariate normal distribution) has two means, two variances, and one correlation: 2(2 − 1)/2. A distribution with 30 stocks has 30 means, 30 variances, and 435 distinct correlations: 30(30 − 1)/2. The return correlation of Dow Chemical with American Express stock is the same as the correlation of American Express with Dow Chemical stock, so these are counted as one distinct correlation.

38
Q

In order to specify the normal distribution for portfolio return, we need:

A

we need the means, variances, and the distinct pairwise correlations of the component securities.

39
Q

The normal density function

A
40
Q

Standard normal distribution

A

The normal density with mean (μ) equal to 0 and standard deviation (σ) equal to 1.

41
Q

Also called Unit normal distribution, how does this relate to options?

A

Option returns are skewed. Because the normal is a symmetrical distribution, we should be cautious in using the normal distribution to model the returns on portfolios containing significant positions in options.

42
Q

Having established that the normal distribution is the appropriate model for a variable of interest, we can use it to make the following probability statements:

A
  • Approximately 50 percent of all observations fall in the interval μ ± (2/3)σ.
  • Approximately 68 percent of all observations fall in the interval μ ± σ.
  • Approximately 95 percent of all observations fall in the interval μ ± 2σ.
  • Approximately 99 percent of all observations fall in the interval μ ± 3σ.
43
Q

Standardizing

A

A transformation that involves subtracting the mean and dividing the result by the standard deviation.

44
Q

Standard normal random variable

A

Z = (X – μ)/σ  

REMEMBER: N(−x) = 1 − N(x)

45
Q

The following are some of the most frequently referenced values in the standard normal table:

A
  • The 90th percentile point is 1.282: P(Z ≤ 1.282) = N(1.282) = 0.90 or 90 percent, and 10 percent of values remain in the right tail.
  • The 95th percentile point is 1.65: P(Z ≤ 1.65) = N(1.65) = 0.95 or 95 percent, and 5 percent of values remain in the right tail. Note the difference between the use of a percentile point when dealing with one tail rather than two tails. Earlier, we used 1.65 standard deviations for the 90 percent confidence interval, where 5 percent of values lie outside that interval on each of the two sides. Here we use 1.65 because we are concerned with the 5 percent of values that lie only on one side, the right tail.
  • The 99th percentile point is 2.327: P(Z ≤ 2.327) = N(2.327) = 0.99 or 99 percent, and 1 percent of values remain in the right tail.
46
Q

Mean-variance analysis and when is it applicable

A

An approach to portfolio analysis using expected means, variances, and covariances of asset returns.

In economic theory, mean–variance analysis holds exactly when investors are risk averse; when they choose investments so as to maximize expected utility, or satisfaction; and when either 1) returns are normally distributed, or 2) investors have quadratic utility functions.

47
Q

Utility functions

A

Utility functions are mathematical representations of attitudes toward risk and return.

48
Q

Safety-first rules

A

Rules for portfolio selection that focus on the risk that portfolio value will fall below some minimum acceptable level over some time horizon.

49
Q

Shortfall risk

A

The risk that portfolio value will fall below some minimum acceptable level over some time horizon.

50
Q

Explain Roy’s safety-first criterion

A

Suppose an investor views any return below a level of RL* as unacceptable. Roy’s safety-first criterion states that the optimal portfolio minimizes the probability that portfolio return, *RP, falls below the threshold level, RL*.25 In symbols, the investor’s objective is to choose a portfolio that minimizes *P*(*RP < RL*). When portfolio returns are normally distributed, we can calculate *P*(*RP < RL*) using the number of standard deviations that *RL lies below the expected portfolio return, E(RP*). The portfolio for which *E*(*RP) − RL* is largest relative to standard deviation minimizes *P*(*RP < R**L). Therefore, if returns are normally distributed, the safety-first optimal portfolio maximizes the safety-first ratio (SFRatio):

51
Q

Roy’s safety-first criterion

A

SFRatio = [E(RP*) – *RL]/σP

52
Q

There are two steps in choosing among portfolios using Roy’s criterion (assuming normality):

A

1) Calculate each portfolio’s SFRatio.
2) Choose the portfolio with the highest SFRatio.

53
Q

Scenario analysis

A

Analysis that shows the changes in key financial quantities that result from given (economic) events, such as the loss of customers, the loss of a supply source, or a catastrophic event; a risk management technique involving examination of the performance of a portfolio under specified situations. Closely related to stress testing.

54
Q

Value at risk

A

(VaR) A money measure of the minimum value of losses expected during a specified time period at a given level of probability.

55
Q

The lognormal distribution

A

Closely related to the normal distribution, the lognormal distribution is widely used for modeling the probability distribution of share and other asset prices. For example, the lognormal appears in the Black–Scholes–Merton option pricing model. The Black–Scholes–Merton model assumes that the price of the asset underlying the option is lognormally distributed.

A random variable Y follows a lognormal distribution if its natural logarithm, ln Y, is normally distributed. The reverse is also true: If the natural logarithm of random variable Y, ln Y, is normally distributed, then Y follows a lognormal distribution. If you think of the term lognormal as “the log is normal,” you will have no trouble remembering this relationship.

56
Q

Two must noteworthy observations about the lognormal distribution:

A

The two most noteworthy observations about the lognormal distribution are that it is bounded below by 0 and it is skewed to the right (it has a long right tail). Note these two properties in the graphs of the pdfs of two lognormal distributions in Figure 7. Asset prices are bounded from below by 0. In practice, the lognormal distribution has been found to be a usefully accurate description of the distribution of prices for many financial assets. On the other hand, the normal distribution is often a good approximation for returns. For this reason, both distributions are very important for finance professionals.

57
Q

The expressions for the mean and variance of a lognormal variable are? Where μ and σ2are the mean and variance of the associated normal distribution (refer to these expressions as needed, rather than memorizing them):

A
  • Mean (μL) of a lognormal random variable = exp(μ + 0.50σ2)
  • Variance (σL2) of a lognormal random variable = exp(2μ + σ2) × [exp(σ2) − 1]
58
Q

Continuous vs. discrete compounding

A

Continuous compounding treats time as essentially continuous or unbroken, in contrast to discrete compounding, which treats time as advancing in discrete finite intervals. Continuously compounded returns are the model for returns in so-called continuous time finance models such as the Black–Scholes–Merton option pricing model.

59
Q

Continuous time

A

Time thought of as advancing in extremely small increments.

60
Q

Price relative

A

A ratio of an ending price over a beginning price; it is equal to 1 plus the holding period return on the asset.

  • S1/S*0, is an ending price, S1, over a beginning price, S0; it is equal to 1 plus the holding period return on the stock from t = 0 to t = 1;
  • St+1/St* = 1 + R**t,<em>t</em>+1
61
Q

Continuously compounded return

A

The natural logarithm of 1 plus the holding period return, or equivalently, the natural logarithm of the ending price over the beginning price.

In this reading we use lowercase r to refer specifically to continuously compounded returns.

62
Q

The continuously compounded return from t to t + 1 is:

A

rt,t+1 = ln(St+1/St) = ln(1 + Rt,t+1)  

63
Q

Independently and identically distributed (IID)

A

With respect to random variables, the property of random variables that are independent of each other but follow the identical probability distribution.

A key assumption in many investment applications is that returns are independently and identically distributed (IID).

Independence captures the proposition that investors cannot predict future returns using past returns (i.e., weak-form market efficiency). Identical distribution captures the assumption of stationarity.

64
Q

Stationary

A

Stationarity implies that the mean and variance of return do not change from period to period.

Assume that the one-period continuously compounded returns (such as r0,1) are IID random variables with mean μ and variance σ2 (but making no normality or other distributional assumption). Then the equation would be:

E(r0,<em>T</em>) = E(rT*–1,<em>T</em>) + *E*(*rT–2,<em>T</em>–1) + … + E(r0,1) = μT  

(we add up μ for a total of T times) and

σ2(r0,<em>T</em>) = σ2T  

Standard deviation as well.

65
Q

Volatility

A

As used in option pricing, the standard deviation of the continuously compounded returns on the underlying asset.

Volatility is also called the instantaneous standard deviation, and as such is denoted σ. The underlying asset, or simply the underlying, is the asset underlying the option.

66
Q

Annualizing is often done on the basis of how many days per year?

A

Annualizing is often done on the basis of 250 days in a year, the approximate number of days markets are open for trading. The 250-day number may lead to a better estimate of volatility than the 365-day number. Thus if daily volatility were 0.01, we would state volatility (on an annual basis) as

0.01*SQRT(250)=0.1581

67
Q
A