Quantitative methods Flashcards

Question

net return large vs small fund

Answer 1

small fund at disadvantage due to fixed administration costs

Answer 2

R_p + (V_d/V_e)(R_p - r_d)

Answer 3

> discount e.g. zero coupon bond (FV-PV) > periodic interest e.g. bonds w coupons > level payments : pay price + pay cash flows at intervals both interest and principal ( amortizing loans)

Answer 4

r(PV) / (1-(1+r)^(-t))

Answer 5

payout / (r-g)

Answer 6

(p*(1+g))/(r-g)

Answer 7

(1+spot rate) ^(n-i) * (1+ forward)^(n-i)

Answer 8

> spot FX * IR = forward FX > continuous compounding

Answer 9

(n+1)*(y/100)

Answer 10

> dispersion > (sum abs(x-xavg))/n

Answer 11

((SUM_(x<=B)(X-B)^2)/(n-1))^(1/2)

Answer 12

sample st dev / sample mean

Answer 13

positive: > small losses and likely > profits large and unlikely > invesotrs prefer distribution with large freq of unuasally large payoffs

Answer 14

observations/ distribution in its tails than normal distrib > platykurtic (thin tails, flat peak) > mesokurtic (normal distr) > leptokurtic (fat tails, tall peak)

Answer 15

higher chance of extrmees in tails

Answer 16

1. kurotsis < 3 , excess kurotsis -ve 2. kurtosis = 3, excess kurtosis 0 3. kurotsis > 3, excess kurotsis +ve

Answer 17

> chance rel > mix of two variables divided by third induce correl > rel of two var between third have correl

Answer 18

(prob of new info given event / unconditional prob of new info) * prior prob of event

Answer 19

[P(info|event)/P(info)]*P(event)

Answer 20

P(F)*P(E|F)/[P(F)*P(E|F)+P(Fnot)*P(E|Fnot)]

Answer 21

P(E)/[(1-P(E)]

Answer 22

[(1-P(E)]/P(E)

Answer 23

> Probability - relative frequency > historical data > Does not vary from person to person > objective probabilities

Answer 24

> Probability - logical analysis or reasoning > Does not vary from person to person > Objective probabilities

Answer 25

> Probability - personal or subjective judgment > No particular reference to historical data > used in investment decisions

Answer 26

P(C) = P(CA)+P(CB)

Answer 27

P(B or C) = P(B) + P(C) – P(B and C)

Answer 28

P(B C) = P(B) x P(C| B)

Answer 29

P(C) = P(B) x P(C given B) + P(Bnot) x P(C given Bnot) = P(C and B) + P(C and Bnot)

Answer 30

= ( n1)( n2 )( )....(nk )

Answer 31

seq does not matter

Answer 32

P * (r-E(r_a))(r-E(r_b))

Answer 33

return below min level (E(R_p)- R_l) / sigma_p

Answer 34

- Optimal portfolio: minimizes the probability that portfolio returns fall below a specified level - If returns are normally distributed, optimal portfolio maximizes safety-first ratio

Answer 35

- Stress testing and scenario analysis - Value-at-Risk (VaR) - value of losses expected over a specified time period at a given level of probability

Answer 36

> no knowledge of population > sample of size n > Unlike CLT that considers all samples of size n from the population - samples of size n from the known sample that also has size n > Each data item in our known sample can appear once or more or not at all in each resample (due to replacement) > computer simulation to mimic the process of CLT : randomly drawn sample as if population > Easy to perform but only provides statistical estimates not exact results

Answer 37

repeatedly draws samples from one observed sample to make statistical inferences about population parameters.

Answer 38

> large number of random samples : represent the role of risk in the system > specified probability distribution e.g. pension assets with reference to pension liabilities > Produces a frequency distribution for changes in portfolio value > Tool for valuing complex securities

Answer 39

* Complement to analytical methods - Only provides statistical estimates, not exact results - Analytical methods provide more insight to cause-and-effect relationships

Answer 40

* Sample from a historical record of returns or other underlying variables * Underlying rationale is that the historic record provides the best evidence of distributions * Limited by the actual events in the historic record used * Does not lend itself to ‘what if’ analysis like Monte Carlo simulation

Answer 41

diff be/een statistic and estimated parameter

Answer 42

- divided into strata - simple random samples taken from each e.g. bond indices - Guarantees population subdivisions are represented

Answer 43

- divided into clusters – mini-representation of the entire population -certain clusters chosen as a whole using simple random sampling - If all members in each sample cluster are sampled: one-stage cluster sampling - If a subsample is randomly selected from each selected cluster : twostage cluster sampling - time-efficient and cost-efficient but the cluster might be less representative of the population

Answer 44

Might be used for a pilot study before testing a large-scale and more representative sample

Answer 45

Sample could be affected by the bias of the researcher

Answer 46

* Assuming any type of distribution and a large sample - Distribution of sample mean is approximately normal - Mean of the distribution of sample mean will be equal to population mean - Variance of distribution of sample mean equals population variance divided by the sample size

Answer 47

> no knowledge of what the population looks like > sample of size n which is assumed to be a good representation of the population > unlike bootstrapping items are not replaced > bootstrapping we have B resamples but with jackknife we have n resamples such that resample sizes are n, n-1, n-2, n-3,……, 3, 2, 1 > For a sample of size n, jackknife resampling usually requires n repetitions. In contrast, with bootstrap resampling, we are left to determine how many repetitions are appropriate > used to reduce the bias of an estimator and to find the standard error and confidence interval of an estimator

Answer 48

* Jackknife tends to produce similar results for each run whereas bootstrapping usually gives different results because resamples are drawn randomly * Both can be used to find the standard error or construct confidence intervals for the statistic of other population parameters > such as the median which could not be done using the Central Limit Theorem.

Answer 49

mean : p , var: p(1-p) mean : np , var: np(1-p)

Answer 50

f(x) = 1/#X f(x) = #/(b-a)

Answer 51

> n*(n-1)/2 > feature for the multivariate normal distr

Answer 52

+-2.58 +-1.96 +-1 +- 1.65

Answer 53

n-1 df as t large n>30 approaches normal distri > fatter tails and less peak to normal curve

Answer 54

> asymmetrical and bounded below by 0 > family of dsitributions > chi square (1) > F(2) numeration and denominator df > as n tends to infty the probability density functions becomes more bell curved

Answer 55

unbiased - sample mean = population mean effcient - no other estimator has a sampling distribution with smaller variance consistent - improves w sample size increase

Answer 56

Point estimate +/- (Reliability factor (z_(a/2))x Standard error (sigma/(n)^(1/2))

Answer 57

t-stat (sigma >1)

Answer 58

sample < 30 - z-stat sample > 30 - z-stat

Answer 59

sample < 30 - t-stat sample > 30 - t-stat or z-stat

Answer 60

sample < 30 - N/A sample > 30 - z-stat

Answer 61

sample < 30 - N/A sample > 30 - t-stat or z-stat

Answer 62

- Choice of statistic (z or t) - Choice of degree of confidence - Choice of sample size * Larger sample size decreases width * Larger sample size reduces standard error * Big sample means t-calcs closer to z-calcs - Same for at least 30 observations

Answer 63

cost cross- poulation data

Answer 64

Not equal to alternative hypothesis * H0 : ϴ = ϴ0 versus Ha : ϴ ≠ ϴ0

Answer 65

- A greater than alternative hypothesis * H0 : ϴ ≤ ϴ0 versus Ha : ϴ > ϴ0 - A less than alternative hypothesis * H0 : ϴ ≥ ϴ0 versus Ha : ϴ < ϴ0

Answer 66

(mean - estimated mean) / standard error

Answer 67

subtract 0.3 from 2 tail for z-stat

Answer 68

accept false null + reject true null

Answer 69

Reduces Type I error, but increases chances of Type II error

Answer 70

- Increase sample size

Answer 71

* Probability of correctly rejecting H0 when it is false - 1-β

Answer 72

false discovery rate BH number adjusted p − value = α*(Rank of i /Number of tests) --- compare p -value w BH - reject null if p value less

Answer 73

1. State the hypotheses - Null hypothesis is stated as H0: μd = 0 - I.e. there is no difference in the populations’ mean daily returns (var unknowns but assumed equal) 2. Identify the appropriate test statistic and its probability distribution - t-test statistic and t-distribution 3. Specify the significance level - 5% significance level 4. State the decision rule - If the test statistic > critical value, reject the null hypothesis

Answer 74

chi-squared distributed with n-1 degrees of freedom two-tailed because distrib not symmetrical chi^2_(n-1)=((n-1)s^2)/sigma^2_0

Answer 75

* Normally distributed population * Random sample * Chi-square test is sensitive to violations of its assumptions

Answer 76

* Using sample variances to determine whether the population var are equal * F-distribution - Asymmetrical and bounded by zero - one-tailed * Calculation of F test statistic F = s^2/ s^2 ≥ 1 as larger sample variance is numerator > df: n-1 / n-1

Answer 77

* assumptions about the distribution of the population * E.g., z-test, t-test, chi-square test, or F-test

Answer 78

1. Data does not meet distributional assumptions -not normally distributed + small sample 2. OUTliers that affect a parametric statistic (the mean) but not a nonparametric statistic (the median) 3. Data is given in ranks 4. Characteristics being tested is not a population parameter

Answer 79

Parametric: t-distributed test z-distributed test Non-Parametric: Wilcoxon signed-rank test

Answer 80

Parametric: t-distributed test Non-Parametric: Mann-Whitney U test (Wilcoxon rank sum test)

Answer 81

A paired comparisons test is appropriate to test the mean differences of two samples believed to be dependent. Parametric: t-distributed test Non-Parametric: Wilcoxon signed-rank test Sign test

Answer 82

both variables are distributed normally parametric test t tables (two-tailed p/2) and n-2 degrees of freedom: t= r(n-2)^(1/2) / (1-r^2)^(1/2)

Answer 83

1. Degrees of freedom increases and critical statistic falls 2. Numerator increases and test statistic rises

Answer 84

nonnormal distrbution nonparemtric test 1. Rank observations on X from largest to smallest assigning 1 to the largest, 2 to the second, etc. Do the same for Y. 2. Calculate the difference, di, between the ranks for each pair of observations and square answer =1 - (6*sum(d^2)/n(n^2-1)) sample size is large (n>30) we can conduct a t-test : df: n-2 = r((n-2)^(1/2))/(1-r^2)^(1/2)

Answer 85

The estimated intercept, b0, and slope, b1, are such that the sum of the squared vertical distances from the observations to the fitted line is minimized.

Answer 86

sum((x-xbar)(y-ybar))/(n-1)

Answer 87

covariance(x,y)/var(x)

Answer 88

b0bar = Ybar - b1Xbar

Answer 89

1. Linear relationship – might need transformation to make linear 2. Independent variable is not random – assume expected values of independent variable are correct 3. Variance of error term is same across all observations (homoskedasticity) 4. Independence – The observations, pairs of Y’s and X’s, are independent of one another. error terms are uncorrelated (no serial correlation) across observations 5. Error terms normally distributed

Answer 90

sum(y-ybar)^2 Sum of the squared differences between the actual value of the dependent variable and the mean value of the dependent variable

Answer 91

sum(yhat-ybar)^2 Sum of the squared differences between the predicted value of the dependent variable based on the regression line and the mean value of the dependent variable.

Answer 92

sum(y-yhat)^2 Sum of the squared differences between the actual value of the dependent variable and the predicted value of the dependent variable based on the regression line

Answer 93

> SSE =0 and RSS = TSS - perfect fit > percentage variation in the dependent variable explained by movements in the independent variable R^2 = RSS / TSS or (1-(SSE/TSS)) r = sign of b1*(R^2)^(1/2)

Answer 94

k =1 indep var (measures the number of independent var) sum(yhat-ybar)^2 MSR = SSR / DF

Answer 95

n-k-1 sum(y-yhat)^2 MSE = SSE/ DF

Answer 96

MSE^(1/2) SSE / n-2

Answer 97

F-distributed Test Statistic H0: b0=b1=...=0 F-test= (SSR / k) / SSE / (n-k-1) = MSR / MSE > df = k , df = n-k-1

Answer 98

- H0: b1 = 0 tcalc = (bhat - b)/SE SE = (MSE)^(1/2) / (SUM(X-Xbar)^2)^(1/2) or HO: b <= 0 or H0: b=1

Answer 99

H0: b0 = specified value tcalc = (bhat0- b0) / SE df= n-k-1 SE = SEE * (1/N + Xbar^2/sum(x-xbar)^2)^(1/2)

Answer 100

* Smallest level of significance at which the null hypothesis can be rejected * Smaller the p-value, stronger the evidence against the null hypothesis * The smaller the p-value, the smaller the chance of making a Type I error (rejecting the null when, in fact, it is true), but increases the chance of making a Type II error (failing to reject the null when, in fact, it is false)

Answer 101

Y =Ŷf±tc*sf s^2f= SEE^2[1+1/N+(Xf-Xbar)^2/(n-1)s^2x] for y predicted need to plug value into linear equation

Answer 102

Slope coefficient represents the relative change in the dependent variable for an absolute change in the independent variable

Answer 103

Slope coefficient gives the absolute change in the dependent variable for a relative change in the independent variable

Answer 104

Slope coefficient gives the relative change in the dependent variable for a relative change in the independent variable and is useful for calculating elasticities

Answer 105

V0 = hS0 - c0 V1 +/- = hS1+/- -c1+/- because we are hedged V1+ = V1- h (hedge ratio) = (c1+ - c1-) / (S1+ - S1-) return = V1+ / V0 = V1- / V0 = 1+ R hS0 - c0 = V1+ / (1+R)

Answer 106

refers to any descriptive measure of a population characteristic

Answer 107

10% 1.645 1.28 5% 1.96 1.645 1% 2.58 2.33

Answer 108

1st = 1/5 2nd = 2/5 etc e.g. want 3rd quintile 4/5*(n+1) and n 10 then = 8.8 so answer be/teen postion 8 and 9 set numbers into ascending order and interpolate e.g. X8 + (8.8 − 8) × (X9 − X8)

Answer 109

In a tree diagram, a problem is worked backward to formulate an expected value as of today

Answer 110

reject H0 > the correl coefficient is statistically significant remember : when two -tailed test the p-value / 2

Answer 111

= continuous return

Answer 112

equal sign

Answer 113

χ2=∑^m_i=(Oij−Eij)^2/Eij > m = the number of cells in the table, which is the number of groups in the first class multiplied by the number of groups in the second class; > Oij = the number of observations in each cell of row i and column j (i.e., observed frequency); and > Eij = the expected number of observations in each cell of row i and column j, assuming independence (i.e., expected frequency). > (r − 1)(c − 1) degrees of freedom, where r is the number of rows and c is the number of columns Eij=(Total row i)×(Total column j)/ Overall total Standardized residual=Oij−Eij/ √ Eij

Answer 114

to accurately predict outcomes using a different dataset and might be too complex 'overtrained' > treating true parameters as if they are noise is most likely a result of underfitting the mode

Answer 115

(sign of b) sqrt (RSS)

Answer 116

= (Cash flow from operations/Average total assets)

Answer 117

slopes H0: b1 = 0. Ha: b1 ≠ 0

Answer 118

Geometric mean^2

Answer 119

≥ Geometric mean ≥ Harmonic mean

Quantitative methods Flashcards

(148 cards)