Ch 4 Resampling via Monte Carlo Flashcards
WHat is resampling
Resampling: iteratively generate random data and analyse the stats
eg Monte Carlo used to run multiple simulations
- Traditional approach to computing statistics
2. Example: estimate probability S&P returns 5% in a week:
- Assume a probability distribution and algebraically solve for the statistic in question.
- Example: estimate probability S&P returns 5% in a week: π ^ππππ=(1+π_ππππππ¦ ) (1+π_ππ’ππ πππ¦ )β―(1+π_πΉπππππ¦ )β1
Why resample?
Use in what circumstances: (3)
- Particularly powerful when closed form solutions are difficult to compute, such as
a) compute more troublesome βstatisticsβ: products, ratios, VaR e.g. geometric returns, sharpe ratios
b) There is time variation in the dataset, e.g. volatility regimes
c) There are data features that are difficult to capture in a model, e.g. serial correlation, heteroskedasticity
Advances in computing power makes sampling more tractable every day
- Define serial correlation
- Also known asβ¦
- Serial correlation = 0β¦
- Serial correlation = (approaches) 1β¦
- Definition Serial correlation: relationship between given variable and itself over various time intervals. Serial correlations are often found in repeating patterns, when the level of a variable effects its future level.
- AKA βautocorrelationβ or βlagged correlation.β
- serial correlation = 0: there is no correlation, each of the observations are independent of one another.
- serial correlation skews toward one, it means that the observations are serially correlated
- Define heteroskedasticity
- two forms
- mathematically..
- Define Heteroskedasticity: when the standard deviations of a variable, monitored over a specific amount of time, are nonconstant.
- 2 forms: conditional and unconditional.
a) Conditional heteroskedasticity identifies nonconstant volatility when future periods of high and low volatility cannot be identified. Not predictable
b) Unconditional heteroskedasticity is used when futures periods of high and low volatility can be identified. Predictable in nature - eg seasonality - refers to the error variance, or dependence of scatter, within a minimum of one independent variable within a particular sample. These variations can be used to calculate the margin of error between data sets, such as expected results and actual results, as it provides a measure for the deviation of data points from the mean value.
Define Resampling
Resampling: repeatedly calling samples from a probability distribution, using tens or hundreds of thousands. The probability distribution may be analytical (e.g. Normal, Studentβs t), or it may be empirical (realised)
Define Bootstrapping
Bootstrapping β sampling with replacement
Define Jackknife
Jackknife: randomly excludes a small number of observations on each iteration. Applies only to a random set of observations. Cannot be used with an assumed analytic distribution
Define Monte Carlo
Monte Carlo: Resampling with a known analytic distribution
Portfolio Opimisation in Practice: Phillipe Jorion
- Jorian created 1000 alternative histories using a multivariate Normal distribution using the realised returns and covariances
- Using each alternative history, he constructed a mean-variance optimal asset-allocation, rebalancing each month for 10 years
- He tested how each of these alternative asset allocation porftolios performed on the real history
Bootstrapping (sampling with replacement)
- SImple method
- Another example
β Eg simple method
β Take original data set
β Resample from that set to form a new sample
β Repeat many times, then for each sample calculate the required stats.
β Create histogram of results
β Another example
β Loop 1000 times. Simulate (10 yrs of monthly returns using these numbers
β Calculate average returns, volatilities and correlations
β Derive optimum portfolioβs AA using MVO and parameters of previous step
β Determine how portfolio wouold have performed using actual historical data
β Plot the means and volatilities
β Plot asset allocations
Issues with bootstrapping (4)
β Level of returns
β Heteroskedasticity
β (Standard deviations are non constant)
β Changing Correlations (may need to adjust correlations to better fit expectations of changing correlations in the future)
β Autocorrelation
β Similarity of a variable with itself over a separate block of time. Consider momentum.
β Periods where markets generally go up, followed by periods where they are generally decreasing (serial correlation)
Block bootstrapping - why use
β By randomly choosing individual months in the sample we destroy any autocorrelation in original data set β ie it wont be seen in the sample
β Mitigate this by BLOCK BOOTSTRAPPING: rather than choosing 1 month at a time; select a block of months (ie n = number of months selected in sequence)