Quantitative Methods Flashcards
Numerical Data (e.g. Discrete, Continuous)
Values that can be counted.
We can perform mathematical operations only on numerical data.
Categorical Data (e.g. Nominal, Ordinal)
consist of labels that can be used to classify a set of data into groups. Categorical data may be nominal or ordinal.
Discrete Data
Countable data , such as the months, days, or hours in a year
Continuous Data
Can take any fractional value (e.g., the annual percentage return on an investment).
Nominal Data
Data that cannot be placed in a logical order
Ordinal Data
Can be ranked in logical order
Structured Data
Data that can be organised in a defined way
Time series
A set of observations taken periodically e.g. at equal intervals over time
Cross-sectional data
Refers to a set of comparable observations all taken at one specific point in time.
Panel Data
Time series and cross-sectional data combined
Unstructured Data
A mix of data with no defined structure
One-dimensional array
represents a single variable (e.g. a time series)
Two-dimensional array
Represents two variables (e.g. panel data)
Contingency table
A two-dimensional array that displays the joint frequencies of two variables
Confusion matrix
A contingency table (two variables) that displays predicted and actual occurrences of an event
Relationship between geometric and arithmetic mean
The geometric mean is always less than or equal to the arithmetic mean, and the difference increases as the dispersion of the observations increases
Trimmed mean
Estimate the mean without the effects of a given percentage of outliers.
Winsorized mean
Decrease the effect of outliers on the mean.
Harmonic mean
Calculate the average share cost from periodic purchases in a fixed dollar amount.
Empirical probability
established by analysing past data (outcomes)
Priori probability
determined using reasoning and inspection process (not data) e.g. looking at a coin and deciding there is a 50/50 chance of each outcome.
Subjective probability
Established using personal judgement
Unconditional probability (marginal probability)
the probability of an event regardless of the past or future occurrence of other events.
Conditional probability
where the occurrence of one event affects the probability of the occurrence of another event. e.g. Prob (A I B)
Multiplication rule of probability
P (AB) = P (A I B) * P (B)
Addition rule of probability
P (A or B) = P (A) + P (B) - P (AB)
Probability distribution
The probabilities of all the possible outcomes for a random variable
A discrete random variable
when the number of possible outcomes in a probability can be counted and there is a measurable/positive probability. e.g. the number of days it may rain in a month.
A continuous random variable
When the number of possible outcomes is infinite, even if upper and lower bands exist. e.g. the amount of rainfall per month.
The probability function , p(x)
gives the probability that a discrete random variable will equal X
A cumulative probability function (cdf) , F(x)
gives the probability that a random variable will be less than or equal to a given value.
Binomial Random Variable - E(X) = np
Binomial Random Variable - Var(X) = np(1-p)
For a continuous random variable X, the probability of any single value of X is
0
The normal distribution has the following key properties:
- It is completely described by its mean, μ, and variance, σ2, stated as X ~ N(μ, σ2).
In words, this says that “X is normally distributed with mean μ and variance σ2.” - Skewness = 0 (symmetrical),
meaning that the normal distribution is symmetric about its mean, so that P(X ≤ μ) = P(μ ≤ X) = 0.5, and mean = median = mode. - Kurtosis = 3;
this is a measure of how flat the distribution is. Recall that excess kurtosis is measured relative to 3, the kurtosis of the normal distribution. - A linear combination of normally distributed random variables is also normally distributed.
- The probabilities of outcomes further above and below the mean get smaller and smaller but do not go to zero (the tails get very thin but extend infinitely).
univariate distribution
the distribution of a single random variable
A multivariate distribution
the distribution of two or more random variables (takes into account correlation coefficients)
- specifies the probabilities associated with a group of random variables and is meaningful only when the behavior of each random variable in the group is in some way dependent on the behavior of the others.
Number of correlations in a portfolio
0.5n*(n-1)
n = no. of assets in portfolio / variables
Normal distribution: +/-1 s.d. from the mean
68% confidence interval
Normal distribution: +/-1.65 s.d. from the mean
90% confidence interval
Normal distribution: +/-1.96 s.d. from the mean
95% confidence interval
Normal distribution: +/-2.58 s.d. from the mean
99% confidence interval
“standardizing a random variable” (finding z)
measuring how far it lies from the arithmetic mean
z = the no. of standard deviations the variable is from the mean
How to calculate z
how many standard deviations a variable is from the mean
z = ( x - pop. mean ) / s.d.
shortfall risk
probability that a portfolio return or value will be below a target return or value
Roy’s safety first ratio (SF ratio)
no. of standard deviations the target return is from the expected return/value
The larger the SF ratio, the lower the probability of falling below the minimum threshold.
For a standard normal distribution, F(0) is:
0.5
By the symmetry of the z-distribution and F(0) = 0.5. Half the distribution lies on each side of the mean. (LOS 4.j)
Holding period of return –> Continuously compounded rate
ln ( 1 + holding period of return)
ln = natural log
Continuously compounded rate –> Holding period of return
e^ continuously compounded rate -1