Quant Flashcards

1
Q

An interest rate, r, can have three interpretations:

A

(1) a required rate of return, (2) a discount rate, or (3) an opportunity cost. An interest rate reflects the relationship between differently dated cash flows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The time value of money establishes WHAT?

A

the equivalence between cash flows occurring on different dates. As cash received today is preferred to cash promised in the future, we must establish a consistent basis for this trade-off to compare financial instruments in cases in which cash is paid or received at different times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

required rates of return—

A

that is, the minimum rate of return an investor must receive to accept an investment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

True or False : we use the terms “interest rate” and “discount rate” almost interchangeably.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

opportunity costs

A

An opportunity cost is the value that investors forgo by choosing a course of action.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

r = Real risk-free interest rate + … +

A

r = Real risk-free interest rate + Inflation premium + Default risk premium + Liquidity premium + Maturity premium.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The real risk-free interest rate is :

A

the single-period interest rate for a completely risk-free security if no inflation were expected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The inflation premium :

A

The inflation premium compensates investors for expected inflation and reflects the average inflation rate expected over the maturity of the debt. Inflation reduces the purchasing power of a unit of currency—the amount of goods and services one can buy with it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the time value of money establish?

A

The equivalence between cash flows occurring on different dates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is cash received today preferred to cash promised in the future?

A

Because there is a preference for immediate receipt, requiring a basis to compare cash flows at different times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define an interest rate (or yield).

A

It is a rate of return that reflects the relationship between differently dated cash flows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

If USD 9,500 today is equivalent to USD 10,000 in one year, what is the implied interest rate?

A

5.26% (USD 500/USD 9,500).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the three ways interest rates can be thought of?

A

As required rates of return, discount rates, and opportunity costs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the required rate of return?

A

The minimum rate an investor must receive to accept an investment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do interest rates function as discount rates?

A

They equate the value of future cash flows to their present value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the opportunity cost in the context of interest rates?

A

The value forgone by choosing one action over another, such as consuming instead of saving.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How are interest rates determined in the market?

A

By the forces of supply and demand for funds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the formula for interest rates incorporating risk factors?

A

A:
𝑟
=
Realrisk-freerate
+
Inflationpremium
+
Defaultriskpremium
+
Liquiditypremium
+
Maturitypremium
r=Realrisk-freerate+Inflationpremium+Defaultriskpremium+Liquiditypremium+Maturitypremium.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does the real risk-free interest rate represent?

A

The single-period interest rate for a completely risk-free security with no expected inflation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the inflation premium compensate investors for?

A

Expected inflation over the maturity of the debt.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the default risk premium?

A

Compensation for the possibility of a borrower failing to make a promised payment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What does the liquidity premium reflect?

A

The risk of loss if an investment needs to be quickly converted to cash.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the nominal risk-free interest rate?

A

The sum of the real risk-free rate and the inflation premium.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are the two primary ways financial assets generate returns?

A

A: Through periodic income (e.g., dividends, interest) and capital gains or losses from price changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Q: Why is the geometric mean return often preferred for multi-period return calculations?

A

A: It accounts for compounding, providing a more accurate representation of portfolio growth.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Q: What is the relationship between arithmetic mean and geometric mean returns?

A

A: The geometric mean is always less than or equal to the arithmetic mean unless all returns are identical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Q: What is the harmonic mean, and when is it used?

A

A: The harmonic mean averages rates or ratios and is used when data involves quantities like P/E ratios or costs per unit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Q: What is the key difference between trimmed and winsorized means?

A

A: Trimmed means exclude extreme values, while winsorized means replace them with the nearest non-extreme values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Q: Why might the arithmetic mean be biased upward in return calculations?

A

A: It does not account for compounding or variability in returns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Q: What is the main application of the geometric mean in investment?

A

A: To measure the compound annual growth rate of an investment over multiple periods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Q: In which situation is the harmonic mean particularly useful?

A

A: When averaging prices or ratios in cost averaging strategies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Q: How is the bias of the arithmetic mean affected by return variability?

A

A: The greater the variability in returns, the larger the difference between the arithmetic and geometric means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Q: What formula connects arithmetic, geometric, and harmonic means?

A

A: Arithmetic Mean
×
× Harmonic Mean = (Geometric Mean)
2
2
.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

In fact, the geometric mean is always less than or equal to the arithmetic mean with one exception:

A

the two means will be equal is when there is no variability in the observations—that is, when all the observations in the series are the same.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

the trimmed mean

A

removes a small defined percentage of the largest and smallest values from a dataset containing our observation before calculating the mean by averaging the remaining observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

The winsorized mean is :

A

calculated after replacing extreme values at both ends with the values of their nearest observations, and then calculating the mean by averaging the remaining observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Flashcard 1
Q: Why don’t arithmetic and geometric return computations account for portfolio cash flow timing?

A

A: They don’t consider the timing of cash inflows and outflows, which can significantly impact returns depending on when investments are made or withdrawn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is the money-weighted return?

A

A: It accounts for the timing and amount of money invested, similar to the internal rate of return (IRR), showing the actual return earned by the investor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Q: How is the internal rate of return (IRR) calculated in relation to cash flows?

A

A: It is the discount rate that equates the present value of all cash inflows and outflows to zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

How does a positive cash flow differ from a negative one?

A

A: Positive cash flow is money received (inflows), while negative cash flow is money spent (outflows).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Q: What is the IRR for an investment with the cash flows

100
,

950
,
+
350
,
+
1270
−100,−950,+350,+1270 over 3 years?

A

A: 26.11%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

How does the timing of cash flows affect the money-weighted return?

A

A: It gives greater weight to periods with larger investments, impacting the overall return calculation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Q: How is the time-weighted return calculated?

A

A: By breaking the period into subperiods, calculating each subperiod’s return, and linking them geometrically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Q: What is the main advantage of the time-weighted return?

A

A: It neutralizes the effects of cash inflows and outflows, making it suitable for comparing portfolio managers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Q: How does daily portfolio valuation improve time-weighted return accuracy?

A

A: Frequent valuation minimizes the impact of cash flow timing on return approximation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Q: In a dividend-paying stock example, what is the IRR when cash flows are

200
,

220
,
+
480
−200,−220,+480?

A

A: 9.39%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Q: What was the mean holding period return for a portfolio with yearly returns of 15% and 6.67%?

A

A: 10.84%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Q: Why might two investors in the same fund have different money-weighted returns?

A

A: Differences in the timing and amounts of their investments lead to varying individual returns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Q: What distinguishes the money-weighted return from the time-weighted return?

A

A: The money-weighted return considers the timing and amount of cash flows, while the time-weighted return eliminates their effects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

True or False : The arithmetic and geometric return computations do account for the timing of cash flows into and out of a portfolio

A

False : The arithmetic and geometric return computations do not account for the timing of cash flows into and out of a portfolio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Q: Why is it important to annualize returns?

A

A: Annualizing returns facilitates comparison by converting daily, weekly, monthly, or quarterly returns into a standard annualized rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What formula is commonly used for option pricing that requires annualized returns?

A

A: The Black–Scholes option-pricing model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What is a limitation of annualizing short-term returns?
.

A

A: It assumes returns can be reinvested repeatedly at the same rate, which may not be realistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

How do continuously compounded returns differ from holding period returns?
.

A

A: Continuously compounded returns are slightly smaller than holding period returns but allow for additive properties in calculations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

That is the significance of
𝑐 in the annualizing formula?

A

A:
𝑐
c represents the number of compounding periods in a year, such as 12 for monthly, 52 for weekly, or 365 for daily returns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What does gross return measure?

A

A: Gross return is the return on assets managed, excluding management fees, custody fees, and taxes, but including trading expenses and commissions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Q: How is net return different from gross return?

A

A: Net return accounts for all managerial and administrative expenses, reflecting what the investor actually receives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Q: What does pre-tax nominal return represent?

A

A: It is the return without adjustments for taxes or inflation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Q: How is after-tax nominal return calculated?

A

A: By deducting taxes on dividends, interest, and realized gains from the total return.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

How does leverage impact returns?

A

A: Leverage amplifies both gains and losses; if portfolio returns exceed borrowing costs, it increases returns, otherwise it decreases them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

Q: What is the risk premium for an asset?

A

A: The return earned above the risk-free rate for taking on additional risk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Q: Why might small mutual funds waive part of their expenses?

A

A: To remain competitive due to their limited ability to spread fixed administrative costs over a large asset base.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Q: Why are real returns useful for international comparisons?

A

A: They account for inflation differences, allowing for consistent comparisons across currencies and time periods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Q: What are the three general cash flow patterns associated with fixed-income instruments?

A

A: 1. Discount instruments (single principal cash flow at maturity).
2. Periodic interest instruments (periodic payments and principal at maturity).
3. Level payments (uniform periodic payments combining interest and principal).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

How do interest rate changes affect bond prices?

A

A: Bond prices move inversely to changes in interest rates; an increase in rates lowers bond prices, while a decrease raises them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

Q: What is Yield-to-Maturity (YTM) and how is it related to bond valuation?

A

A: YTM is the discount rate at which the present value of all future cash flows equals the bond’s current price. It represents the bond’s expected annual return if held to maturity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

Q: What is the implied return for a fixed-income instrument?

A

A: The implied return is the discount rate (YTM) that equates the present value of an instrument’s cash flows to its price.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

How is YTM for a coupon bond calculated?

A

A: YTM is the discount rate that equates the present value of all coupon and principal payments to the bond’s price.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

Q: What does the price-to-earnings (P/E) ratio indicate?

A

A: The P/E ratio measures the market price per share relative to earnings per share, indicating how much investors are willing to pay for each dollar of earnings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

Q: How does the dividend payout ratio affect the forward P/E ratio?

A

A: A higher dividend payout ratio increases the forward P/E ratio, assuming constant required return and growth rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

Q: What should an investor do if the required return exceeds the expected return?

A

A: The stock may be overvalued, and the investor should consider avoiding or selling the position.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

Q: What is the cash flow additivity principle?

A

A: It states that the present value of any future cash flow stream, indexed at the same point in time, equals the sum of the present values of its individual cash flows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

Q: How does the cash flow additivity principle ensure no-arbitrage?

A

A: By ensuring market prices reflect the true value of combined cash flows, it prevents the possibility of earning riskless profits without transaction costs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

Q: How is the cash flow additivity principle used to calculate implied forward interest rates?

A

A: It equates the present value of investing at a long-term rate to the compounded returns of sequential short-term rates, ensuring no arbitrage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

Q: How does cash flow additivity apply to forward exchange rates?
.

A

A: It ensures no arbitrage between different currencies by equating the returns from domestic and foreign investments adjusted for the forward rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

Q: How is the cash flow additivity principle used in option pricing?

A

A: By constructing replicating portfolios that match the option’s cash flows in all future scenarios, ensuring no-arbitrage pricing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

deleteQ: What is the hedge ratio for a put option in the example?

A

A: The hedge ratio is 0.75, representing the number of asset units needed to replicate the option’s payoff.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

Q: In a one-period binomial model, how are option prices determined?

A

A: By equating the present value of a replicating portfolio’s cash flows to its payoff under different scenarios.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

Q: What happens if the forward rate deviates from the no-arbitrage level?

A

A: Investors can exploit the difference by arbitrage, earning riskless profits until equilibrium is restored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

What is a measure of central tendency?

A

A: It specifies where the data are centered and shows the “expected” value based on the observed sample, such as the mean, median, or mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

: What is the arithmetic mean?

A

A: The arithmetic mean is the sum of all observations in a dataset divided by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

What is the median, and why might it be preferred over the mean?

A

A: The median is the middle value of a dataset sorted in order. It is less affected by outliers compared to the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

Q: What is the mode in a dataset?

A

A: The mode is the most frequently occurring value. A dataset can be unimodal, bimodal, or have no mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

Q: What are the three methods for dealing with outliers?

A

A: 1) Do nothing, 2) Delete outliers (e.g., trimmed mean), 3) Replace outliers (e.g., winsorized mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

: What is a trimmed mean?

A

A: It is an arithmetic mean calculated after excluding a small percentage of the lowest and highest values in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

Q: What is a winsorized mean?

A

A: It replaces extreme values with the nearest specified percentile values, such as the 2.5th and 97.5th percentiles for a 95% winsorized mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

Q: What are quantiles, and how are they categorized?

A

A: Quantiles divide a dataset into equal parts. Categories include quartiles (4 parts), quintiles (5 parts), deciles (10 parts), and percentiles (100 parts).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

Q: What is the interquartile range (IQR)?

A

A: The IQR is the difference between the third quartile (Q3) and the first quartile (Q1):
IQR=𝑄3 −𝑄1
IQR=Q3−Q1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

How is the median identified in a histogram?

A

A: The median corresponds to the bin that contains the 50th percentile of the observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

Q: How can quantiles assist in investment practice?

A

A: Quantiles help rank performance (e.g., fund rankings) and analyze characteristics like asset returns across different subsets of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
91
Q

What does the mean return in investments represent?

A

A: The mean return represents the reward of an investment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
92
Q

Q: What does dispersion in investment returns address?

A

A: Dispersion addresses risk and uncertainty by showing how returns are distributed around the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
93
Q

Q: How is the range of a dataset calculated?

A

A: Range = Maximum value − Minimum value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
94
Q

Q: What is a key advantage and disadvantage of the range as a measure of dispersion?

A

A: Advantage: Easy to compute.
Disadvantage: Only uses the maximum and minimum values, ignoring the rest of the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
95
Q

What does the Mean Absolute Deviation (MAD) measure?
.

A

A: MAD measures the average of the absolute deviations from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
96
Q

Why is sample variance a preferred measure over MAD?

A

A: Sample variance is easier to manipulate mathematically because it uses squared deviations instead of absolute deviations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
97
Q

What is the key difference between variance and standard deviation?

A

A: Variance is in squared units, while standard deviation is the square root of variance and shares the same units as the original data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
98
Q

What is the target downside deviation?

A

A: It measures the dispersion of observations below a specified target, often used to assess downside risk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
99
Q

Why is the coefficient of variation (CV) useful?

A

A: It measures relative dispersion, allowing comparison across datasets with different means or units of measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
100
Q

How does standard deviation relate to downside deviation?

A

A: Standard deviation considers both upside and downside variations, whereas downside deviation focuses only on variations below a specified target.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
101
Q

Q: What does the mean return represent in investments?

A

A: The mean return represents the expected reward of an investment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
102
Q

What is dispersion in the context of investment returns?

A

A: Dispersion refers to the variability or spread of returns around the mean, addressing risk and uncertainty.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
103
Q

What do mean and variance fail to describe in an investment’s return distribution?

A

A: They do not indicate whether large deviations are likely to be positive or negative.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
104
Q

Q: What is a symmetrical distribution?

A

A: A distribution where each side is a mirror image of the other, with equal loss and gain intervals having the same frequencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
105
Q

Q: What are the characteristics of a normal distribution?

A

A: The mean, median, and mode are equal, and it is completely described by its mean and variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
106
Q

Q: What does a positively skewed distribution indicate?

A

A: Frequent small losses and a few extreme gains, with a long tail on the right sid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
107
Q

Q: What does a negatively skewed distribution indicate?

A

A: Frequent small gains and a few extreme losses, with a long tail on the left side.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
108
Q

Q: How is skewness calculated?

A

A: By averaging the cubed deviations from the mean, standardized by dividing by the standard deviation cubed.

109
Q

Q: What is kurtosis, and what does it measure?

A

A: Kurtosis measures the combined weight of the tails of a distribution relative to the rest of the distribution.

110
Q

Q: What are the characteristics of leptokurtic distributions?

A

A: They are fat-tailed, generating more frequent extreme deviations from the mean than normal distributions.

111
Q

Q: What does a kurtosis value of 3.0 indicate?

A

A: A mesokurtic distribution with tails similar to the normal distribution.

112
Q

Q: What is excess kurtosis?

A

A: The kurtosis relative to the normal distribution, where a normal distribution has excess kurtosis of 0.

113
Q

Q: What conclusion can be drawn if a stock’s returns have an excess kurtosis of -0.75?

A

A: The distribution is thin-tailed compared to the normal distribution.

114
Q

Q: What is indicated by a positive skewness in a cross-section of annual returns?

A

A: The returns are positively skewed, meaning the distribution has more extreme positive deviations than negative ones.

115
Q

Q: What is the primary purpose of a scatter plot?

A

A: To display and understand potential relationships between two variables.

116
Q

Q: How does tight clustering in a scatter plot indicate the strength of a relationship?

A

A: Tight clustering signals a stronger relationship, while loose clustering signals a weaker relationship.

117
Q

Q: What does a scatter plot with data points tightly clustered along a positively sloped line suggest?

A

A: A strong positive linear relationship between two variables.

118
Q

Q: What is sample covariance?

A

A: A measure of how two variables in a sample move together, calculated as the average product of their deviations from the mean.

119
Q

Q: Why is covariance difficult to interpret?

A

A: It involves squared units, making its magnitude dependent on the variables’ scale.

120
Q

Q: What is the sample correlation coefficient?

A

A: A standardized measure of how two variables move together, calculated as the ratio of sample covariance to the product of standard deviations.

121
Q

Q: What are the possible values for the correlation coefficient?

A

A: Between −1 and +1.

122
Q

Q: What does a correlation coefficient of 0 indicate?

A

A: No linear relationship between the two variables.

123
Q

Q: What are potential problems with interpreting a correlation coefficient?

A

A: Outliers, spurious correlation, and non-linear relationships.

124
Q

Q: What is spurious correlation?

A

A: A correlation that arises by chance, from a calculation involving a third variable, or from relationships with a third variable.

125
Q

Q: Why is it essential to avoid assuming causation from correlation?

A

A: Correlation does not imply that one variable causes changes in another; the relationship could be coincidental or influenced by a third variable.

126
Q

ront: How does the expected value differ from the historical or sample mean?
Back:

A

Expected value: A forecast of the “true” mean (population mean), used to predict future outcomes.
Sample mean: A historical measure summarizing observed data, calculated as an equally weighted average of past observations.

127
Q

Front: How do variance and standard deviation differ?

A

Back:

Variance
𝜎^2(𝑋)σ^2
(X): The dispersion of outcomes, expressed in squared units of the random variable.
Standard deviation
𝜎(𝑋)
σ(X): The square root of variance, expressed in the same units as the random variable.

128
Q

Front: How does a probability tree help in investment analysis?

A

Back:
A probability tree represents scenarios and their associated probabilities visually. It aids in:

Structuring mutually exclusive and exhaustive outcomes.
Calculating conditional expectations for different scenarios.
Working backward to compute the unconditional expected value.

129
Q

Front: What is the purpose of conditional variance?

A

Back:
Conditional variance measures the variability or risk of outcomes given a specific scenario. It provides insight into the dispersion of a random variable around its conditional mean under that scenario.

130
Q

Front: Why is consistency between unconditional and conditional expectations important?

A

Back:
Consistency ensures that all probabilities and expected values align logically. Inconsistent calculations can lead to flawed investment decisions and missed opportunities.

131
Q

Front: How are mutually exclusive and exhaustive events used in probability trees?

A

Back:

Mutually exclusive: Events cannot occur simultaneously.
Exhaustive: The set of events covers all possible outcomes.
These properties ensure all probabilities in a tree diagram sum to 1.

132
Q

Front: What is the role of scenarios in investment decision-making?

A

Back:
Scenarios represent key factors or events that influence outcomes. By analyzing probabilities and expectations under different scenarios, investors can make informed decisions about risk and return.

133
Q

Front: Why is Bayes’ formula important in investment decision-making?

A

Back:
It helps update beliefs and probabilities as new information becomes available, allowing investors to make rational decisions under uncertainty.

134
Q

Front: How does Bayes’ formula reverse the “given that” information?

A

Back:
Instead of calculating the likelihood of an observation given a scenario, it calculates the likelihood of a scenario given the observation.

135
Q

Front: What are prior probabilities in Bayes’ formula?

A

Back:
Prior probabilities represent beliefs about the likelihood of an event before receiving new information.

136
Q

Front: What are conditional probabilities or likelihoods?

A

Back:
These are the probabilities of observing specific information given a particular event or scenario has occurred.

137
Q

Front: What is the posterior probability in Bayes’ formula?

A

Back:
The posterior probability is the updated likelihood of an event after considering new information. It reflects the combined impact of priors and observations.

138
Q

Front: What does it mean if the posterior probability is higher than the prior probability?

A

Back:
It means the new information makes the event more likely than initially thought, reflecting a positive revision in beliefs.

139
Q

Front: Why is Bayes’ formula described as a learning mechanism in probability theory?

A

Back:
It allows for continuous updating of beliefs as new data arrives, adapting predictions and decisions dynamically.

140
Q

Q: For a portfolio with weights (0.50, 0.25, 0.25) and expected returns (13%, 6%, 15%), what is
𝐸(𝑅𝑝)?

A

A:
𝐸(𝑅𝑝)=0.50(13)+0.25(6)+0.25(15)=11.75%

141
Q

For weights (0.50, 0.25, 0.25) and a covariance matrix, what is
𝜎2(𝑅𝑝)?

A

A: Using variances and covariances:
𝜎2(𝑅𝑝)=195.875

Standard deviation:
𝜎(𝑅𝑝)=14%

142
Q

Q: What is the impact of decreasing covariance on portfolio risk?

A

A: Decreasing covariance reduces portfolio variance, thus lowering portfolio risk.

143
Q

Q: What are the bounds of the correlation coefficient?

A

A: The correlation coefficient ranges from −1 to +1.

144
Q

Given a portfolio of five stocks, how many unique covariance terms, excluding variances, are required to calculate the portfolio return variance?

a 10
b 20
c 25

A

A is correct. A covariance matrix for five stocks has 5 × 5 = 25 entries. Subtracting the 5 diagonal variance terms results in 20 off-diagonal entries. Because a covariance matrix is symmetrical, only 10 entries are unique (20/2 = 10).

145
Q

Which of the following statements is most accurate? If the covariance of returns between two assets is 0.0023, then the:

a assets’ risk is near zero.
b asset returns are unrelated.
c asset returns have a positive relationship.

A

C is correct. The covariance of returns is positive when the returns on both assets tend to be on the same side (above or below) their expected values at the same time.

146
Q

Q: What is covariance in portfolio returns?

A

A: Covariance measures the degree to which two securities’ returns move together. Positive covariance indicates that returns move in the same direction, while negative covariance shows they move in opposite directions.

147
Q

Flashcard 3
Q: What is the relationship between independence and the joint probability function?

A

A: Two random variables are independent if P(X,Y)=P(X)P(Y). This means the joint probability is the product of individual probabilities.

148
Q

deleteWhat insights did Isabel Vasquez gain from the equity and bond class data?

A

A: Vasquez observed that equity classes had higher variances and covariances than bond classes, but their correlation was lower. Long-term bonds had higher volatility than intermediate-term bonds but still showed high correlation.

149
Q

Q: Why might historical covariance and correlation estimates be used?

A

A: Historical data provides a basis for forecasting future relationships between asset returns, though alternative methods like market model regressions can also be used.

150
Q

Q: How are deviations used in covariance calculations?

A

A: Deviations are the differences between actual returns and their expected values. These deviations are multiplied and weighted by joint probabilities to compute covariance.

151
Q

Q: What is shortfall risk?

A

A: Shortfall risk is the risk that a portfolio’s value or return will fall below a minimum acceptable level over a specified time horizon.

152
Q

What is Roy’s safety-first criterion?

A

A: Roy’s safety-first criterion states that the optimal portfolio minimizes the probability that portfolio return will fall below a specified threshold return RL .

153
Q

How do you determine the optimal portfolio using the SFRatio?

A

A: Calculate the SFRatio for each portfolio and select the portfolio with the highest SFRatio, as it has the lowest probability of falling below the shortfall level.

154
Q

How does the SFRatio compare to the Sharpe ratio?

A

A: The SFRatio is similar to the Sharpe ratio, but substitutes the shortfall level
𝑅𝐿 for the risk-free rate 𝑅𝐹. Maximizing the Sharpe ratio minimizes the probability of returns falling below 𝑅𝐹.

155
Q

Why does the normal distribution play a role in portfolio risk analysis?

A

A: Normal distribution simplifies calculations for probabilities, including shortfall risks and measures like VaR, under the assumption of normally distributed returns.

156
Q

What are VaR and stress testing used for in financial risk management?

A

A: VaR measures the minimum expected losses over a specified time and probability level, while stress testing estimates losses under extreme scenarios.

157
Q

Q: What is the relationship between a lognormal distribution and a normal distribution?

A

A: A random variable
𝑌 follows a lognormal distribution if ln(𝑌)ln(Y) is normally distributed. Conversely, if ln(𝑌)Ln(Y) is normal, Y is lognormal.

158
Q

Why is the lognormal distribution used to model asset prices?

A

A: Asset prices are bounded below by 0 and exhibit right skewness, characteristics well captured by the lognormal distribution.

159
Q

What are the two key properties of the lognormal distribution?

A

A: It is bounded below by 0 and has a long right tail, making it suitable for modeling asset prices.

160
Q

Q: How are the parameters of a lognormal distribution determined?

A

A: They are derived from the mean (𝜇) and variance (𝜎2) of the associated normal distribution of ln(𝑌)ln(Y).

161
Q

How does the assumption of normality in continuously compounded returns affect the modeling of asset prices?

A

A: If continuously compounded returns are normally distributed, future asset prices are lognormally distributed.

162
Q

Q: What assumptions are critical when using the lognormal distribution to model asset prices?

A

A: Returns are assumed to be independently and identically distributed (i.i.d.), with a constant mean and variance.

163
Q

Q: Why is the lognormal distribution a good approximation for asset prices even if returns are not perfectly normal?

A

A: Due to the central limit theorem, sums of returns tend to normality, supporting the lognormal model for prices.

164
Q

Q: What is the relationship between volatility and the standard deviation of continuously compounded returns?

A

A: Volatility is the annualized standard deviation of continuously compounded returns, often estimated using historical data.

165
Q

The two most noteworthy observations about the lognormal distribution are :

A

Are that it is bounded below by 0 and it is skewed to the right (it has a long right tail).

166
Q

What is Monte Carlo simulation?

A

A: It is a technique that generates a large number of random samples from specified probability distributions to obtain the likelihood of a range of results.

167
Q

: How is Monte Carlo simulation used in investment applications?

A

A: It estimates portfolio risk and return by simulating profit and loss performance over a specified time horizon and derives performance and risk measures from a frequency distribution of portfolio returns.

168
Q

Q: Why is Monte Carlo simulation useful for valuing complex securities?

A

A: It is helpful for pricing securities without an analytic formula, such as mortgage-backed securities, by modeling their sensitivity to changes in assumptions.

169
Q

What are the key steps involved in Monte Carlo simulation?

A

A:

Specify the quantity of interest and underlying variables.
Set a time grid and divide the time horizon into subperiods.
Define the method for generating data based on distributional assumptions.
Generate random values for key risk factors.
Use simulated values to calculate outcomes.
Repeat trials to produce summary statistics.

170
Q

How does Monte Carlo simulation value contingent claims, such as an Asian option?

A

A: By simulating stock prices over time, calculating the average price during the claim’s life, and determining the payoff as the difference between the final and average stock prices, if positive.

171
Q

Q: What are the strengths of Monte Carlo simulation compared to analytical methods?

A

A: It allows testing model sensitivity to assumptions and can handle complex securities and scenarios where no analytic solutions exist.

172
Q

Q: What are the weaknesses of Monte Carlo simulation?

A

A: It provides statistical estimates rather than exact results and cannot independently establish cause-and-effect relationships.

173
Q

Q: How can Monte Carlo simulation test the sensitivity of models in investment applications?

A

A: By modifying key assumptions, such as distributions of risk factors, and observing how changes affect outcomes.

174
Q

Compared with analytical methods, what are the strengths and weaknesses of using Monte Carlo simulation for valuing securities?

A

Solution:

Strengths: Monte Carlo simulation can be used to price complex securities for which no analytic expression is available, particularly European-style options.

Weaknesses: Monte Carlo simulation provides only statistical estimates, not exact results. Analytic methods, when available, provide more insight into cause-and-effect relationships than does Monte Carlo simulation.

175
Q

A Monte Carlo simulation can be used to:

directly provide precise valuations of call options.
simulate a process from historical records of returns.
test the sensitivity of a model to changes in assumptions—for example, on distributions of key variables.

A

Solution:

C is correct. A characteristic feature of Monte Carlo simulation is the generation of a large number of random samples from a specified probability distribution or distributions to represent the role of risk in the system. Therefore, it is very useful for investigating the sensitivity of a model to changes in assumptions—for example, on distributions of key variables.

176
Q

A limitation of Monte Carlo simulation is:

its failure to do “what if” analysis.
that it requires historical records of returns.
its inability to independently specify cause-and-effect relationships.

A

Solution:

C is correct. Monte Carlo simulation is a complement to analytical methods. Monte Carlo simulation provides statistical estimates and not exact results. Analytical methods, when available, provide more insight into cause-and-effect relationships.

177
Q

Q: What is bootstrap resampling?

A

A: A computational method that repeatedly draws samples from an observed dataset (with replacement) to estimate population parameters like mean, variance, skewness, and kurtosis.

178
Q

How does bootstrap mimic sampling from a population?

A

A: By treating the observed sample as a proxy for the population, it simulates sampling by resampling with replacement from the original data.

179
Q

: What are the steps in bootstrap resampling for investment simulations?

A

A:

Specify the quantity of interest and underlying variables.
Set a time grid consistent with the data’s periodicity.
Use observed data as the empirical distribution for simulations.
Generate random values via resampling and simulate outcomes.
Calculate outcomes like contingent claim value.
Repeat trials and derive summary statistics.

180
Q

Q: How is bootstrap resampling different from Monte Carlo simulation?

A

A: Bootstrap uses observed data to create an empirical distribution, while Monte Carlo simulation generates random data from predefined probability distributions.

181
Q

What are the strengths of bootstrap resampling?

A

A:

Simple to perform.
Provides a good representation of population features based on observed data.
Avoids reliance on analytical formulas like z- or t-statistics.

182
Q

What are the weaknesses of bootstrap resampling?

A

A:

Provides statistical estimates, not exact results.
Results depend on the quality and representativeness of the observed sample.

183
Q

What are the main strengths and weaknesses of bootstrapping?

A

Solution:

Strengths:

Bootstrapping is simple to perform.

Bootstrapping offers a good representation of the statistical features of the population and can simulate sampling from the population by sampling from the observed sample.

Weaknesses:

Bootstrapping provides only statistical estimates, not exact results.

184
Q

Front: Why do analysts use sampling instead of examining the entire population?

A

Back: Analysts use sampling to save time and money when examining the entire population is infeasible or inefficient.

185
Q

Front: What is the key distinction between probability and non-probability sampling?

A

Back:

Probability Sampling: Every population member has an equal chance of being selected, producing representative samples with lower sampling error.
Non-Probability Sampling: Relies on convenience or judgment, with a higher risk of non-representative samples and potential bias.

186
Q

Front: What is simple random sampling, and when is it appropriate?

A

Back:

Definition: A subset where every population member has an equal probability of selection.
Use Case: Best for homogeneous populations with broadly similar characteristics.

187
Q

Front: How does stratified random sampling differ from simple random sampling?

A

Back:

Stratified Random Sampling: Divides the population into strata (subpopulations) and samples proportionally from each stratum.
Advantages: Ensures representation of all subpopulations and yields more precise parameter estimates.

188
Q

Front: What is cluster sampling, and what are its pros and cons?

A

Back:

Definition: Divides the population into clusters, then randomly selects entire clusters or subsamples within clusters.
Pros: Cost-efficient for large, dispersed populations.
Cons: Yields less accurate results than other probability sampling methods due to intra-cluster similarity.

189
Q

Front: What is convenience sampling, and what are its limitations?

A

Back:

Definition: Selects samples based on ease of access.
Limitations: May produce biased, non-representative samples, limiting accuracy.

190
Q

Front: What is judgmental sampling, and when might it be used?

A

Back:

Definition: Relies on the researcher’s expertise to select samples.
Use Case: When expert knowledge is critical to target specific, representative samples quickly (e.g., auditing).

191
Q

Front: How does sampling error arise, and what is its impact?

A

Back:

Cause: Arises when only a subset of the population is sampled, leading to differences between sample statistics and population parameters.
Impact: Sampling error decreases with larger, well-chosen samples, especially in probability sampling methods.

192
Q

Q: What does the Central Limit Theorem (CLT) state about the distribution of the sample mean?

A

A: For a population with mean 𝜇 and finite variance 𝜎2, the sampling distribution of the sample mean 𝑋ˉwill be approximately normal with mean 𝜇 and variance 𝜎2/𝑛 when the sample size
𝑛 is large, regardless of the population’s distribution.

193
Q

Q: Why is the Central Limit Theorem important in statistics?

A

A: The CLT enables us to make probability statements about the population mean using the sample mean, regardless of the population’s distribution, as long as the sample size is sufficiently large. This is crucial for constructing confidence intervals and hypothesis testing.

194
Q

Q: Are the following statements about sampling correct?

Sample means from large samples are approximately normally distributed.
The population must be normally distributed for the sample mean to be normally distributed.

A

A: Statement 1 is true; Statement 2 is false. The CLT ensures the sample mean is approximately normal for large sample sizes, regardless of the population’s distribution.

195
Q

Which of the following best describes the validity of the analyst’s statements?

Statement 1 is false; Statement 2 is true.
Both statements are true.
Statement 1 is true; Statement 2 is false.

A

Solution:

C is correct. According to the central limit theorem, Statement 1 is true. Statement 2 is false because the underlying population does not need to be normally distributed in order for the sample mean to be normally distributed.

196
Q

: Are the following statements about sampling correct?

Sample means from large samples are approximately normally distributed.
The population must be normally distributed for the sample mean to be normally distributed.

A

A: Statement 1 is true; Statement 2 is false. The CLT ensures the sample mean is approximately normal for large sample sizes, regardless of the population’s distribution.

197
Q

Q: What is the standard error of the sample mean?
.

A

A: The standard error of the sample mean is a measure of the variability of sample means around the true population mean, often estimated using resampling methods like bootstrap

198
Q

What is bootstrap resampling?

A

A: Bootstrap is a resampling method where samples are repeatedly drawn with replacement from the original data to approximate the sampling distribution of a statistic, such as the mean.

199
Q

How are bootstrap samples generated?

A

A: Each resample is the same size as the original sample and is created by randomly drawing items with replacement, allowing some items to appear multiple times and others to be excluded.

200
Q

: What is the purpose of the bootstrap sampling distribution?

A

A: It approximates the true sampling distribution and is used to estimate the standard error and other statistics without relying on analytical formulas.

201
Q

: How does bootstrap compare to jackknife resampling?

A

A: Bootstrap resamples with replacement, producing variability across runs, while jackknife excludes one observation at a time, providing consistent results across runs.

202
Q

Q: When is bootstrap especially useful?

A

A: Bootstrap is beneficial for estimating statistics of complex estimators or when no analytical formula is available. It is widely used in finance for tasks like historical simulations and performance benchmarking.

203
Q

Q: What is hypothesis testing?

A

A: Hypothesis testing is a statistical method used to make decisions about a population based on sample data. It determines whether the sample data supports a stated hypothesis about a population parameter.

204
Q

What are the two hypotheses in hypothesis testing?

A

A: The null hypothesis (H₀) is the hypothesis being tested, typically suggesting no effect or no difference. The alternative hypothesis (Hₐ) is what we accept if we reject H₀, indicating an effect or difference.

205
Q

Q: What are the six steps in the hypothesis testing process?

A

A:

State the hypotheses (H₀ and Hₐ).
Identify the test statistic and its distribution.
Specify the significance level (α).
State the decision rule.
Collect data and calculate the test statistic.
Make a decision (reject or fail to reject H₀).

206
Q

Q: What is the level of significance (α)?

A

A: The level of significance is the probability of rejecting a true null hypothesis (Type I error). Common values are 0.01, 0.05, and 0.10, corresponding to 99%, 95%, and 90% confidence levels.

207
Q

Q: What is a Type I error in hypothesis testing?

A

A: A Type I error occurs when the null hypothesis (H₀) is true, but we mistakenly reject it. Its probability is equal to the significance level (α).

208
Q

What is a Type II error in hypothesis testing?

A

A: A Type II error occurs when the null hypothesis (H₀) is false, but we fail to reject it. Its probability is denoted by β.

209
Q

Q: What is the power of a test?

A

A: The power of a test is the probability of correctly rejecting the null hypothesis when it is false. It is calculated as 1−𝛽

210
Q

How does specifying a smaller significance level (α) affect Type I and Type II errors?

A

A: A smaller α reduces the likelihood of a Type I error but increases the probability of a Type II error, assuming the sample size remains constant.

211
Q

Q: In hypothesis testing, what determines whether to reject or fail to reject H₀?

A

A: Compare the test statistic to critical values or compare the p-value to the significance level (α). Reject H₀ if the test statistic is more extreme than the critical value or if the p-value is less than α.

212
Q

Q: What is the p-value in hypothesis testing?

A

A: The p-value is the smallest level of significance at which the null hypothesis can be rejected. A smaller p-value indicates stronger evidence against H₀.

213
Q

Q: For the test H₀: μ = 10 versus Hₐ: μ ≠ 10 with a calculated t-statistic of 2.05 and critical t-values of ±1.984, what is the conclusion?

A

A: Reject H₀ because the test statistic (2.05) exceeds the critical value (+1.984), indicating statistical significance.

214
Q

Q: What are the null (H₀) and alternative (Hₐ) hypotheses in hypothesis testing?

A

A:

The null hypothesis (H₀) is the default assumption that there is no effect or no difference.
The alternative hypothesis (Hₐ) is the statement that contradicts H₀, suggesting an effect or difference exists.

215
Q

Q: What is the difference between a Type I error and a Type II error in hypothesis testing?

A

A:

Type I error: Rejecting the null hypothesis (H₀) when it is actually true (false positive).
Type II error: Failing to reject the null hypothesis when the alternative hypothesis (Hₐ) is true (false negative).

216
Q

What is the significance level, and how does it relate to the power of a test?

A

A:

The significance level (α) is the probability of making a Type I error. Commonly, α = 0.05.
The power of a test is the probability of correctly rejecting H₀ when Hₐ is true (1 − β). It increases with a larger sample size or effect size.

217
Q

Q: When is the t-distribution used instead of the normal (z) distribution in hypothesis testing?

A

A:
The t-distribution is used when the population standard deviation is unknown, and the sample size is relatively small. It accounts for additional variability due to estimating the standard deviation.

218
Q

Q: How are confidence intervals related to hypothesis tests?

A

A:

A confidence interval provides a range of plausible values for a population parameter.
If the hypothesized value (e.g., mean or variance) falls outside the confidence interval, H₀ is rejected at the corresponding significance level.

219
Q

Why might an analyst test variances, and what test statistic is commonly used?

A

A:

Analysts test variances to assess the consistency or volatility of data (e.g., returns before and after an event).
The chi-square statistic is used for single population variance tests, while the F-test is used to compare two variances.

220
Q

What is a parametric test?

A

A: A parametric test is concerned with parameters of distributions (e.g., mean, variance) and depends on specific assumptions about the population distribution.

221
Q

Q: What is a nonparametric test?

A

A: A nonparametric test is either not concerned with parameters or makes minimal assumptions about the population distribution, often used with ranked or ordinal data.

222
Q

When is it appropriate to use a nonparametric test?

A

A: Nonparametric tests are appropriate when:

Data do not meet distributional assumptions.
There are outliers.
Data are given in ranks or use an ordinal scale.
The hypothesis does not concern a parameter.

223
Q

Q: What are examples of nonparametric alternatives to parametric tests?

A

A: Examples include:

Wilcoxon signed-rank test (for a single mean or paired comparisons).
Mann–Whitney U test (for differences between means).
Sign test (for mean differences in paired data).

224
Q

Q: Why are parametric tests preferred over nonparametric tests when assumptions are met?

A

A: Parametric tests are preferred because they generally have more statistical power, meaning they are better at rejecting a false null hypothesis when assumptions are satisfied.

225
Q

Nonparametric procedures are primarily used in four situations:

A

(1) when the data do not meet distributional assumptions, (2) when there are outliers, (3) when the data are given in ranks or use an ordinal scale, or (4) when the relevant hypotheses do not concern a parameter.

226
Q

An analyst suspects that, in the most recent year, excess returns on stocks have fallen below 5%. She wants to study whether the excess returns are less than 5%. Designating the population mean as μ, which hypotheses are most appropriate for her analysis?

A.H0: µ = 5% versus Ha: µ ≠ 5%
B.H0: µ> 5% versus Ha: µ < 5%
C.H0: µ< 5% versus Ha: µ > 5%

A

B is correct. The null hypothesis is what she wants to reject in favor of the alternative, which is that population mean excess return is less than 5%. This is a one-sided (left-side) alternative hypothesis.

227
Q

Front: How does increasing the sample size
𝑛
n affect the t-test for correlation?

A

Back:

As
𝑛 increases, the t-statistic’s magnitude increases.
A larger sample size makes it more likely to reject the null hypothesis if it is false.

228
Q

Flashcard 1: Q: What is the null hypothesis (𝐻0) for a test of independence using a contingency table?

A

A:
𝐻0 : The two classifications are independent (e.g., dividend and financial leverage groups are not related).

229
Q

Q: What is the alternative hypothesis (𝐻𝑎) for a test of independence using a contingency table?

A

A: 𝐻𝑎: The two classifications are not independent (e.g., dividend and financial leverage groups are related).

230
Q

Q: What are dependent and independent variables in regression analysis?

A

A: The dependent variable (Y) is the one being explained, while the independent variable (X) is used to explain the variation in Y.

231
Q

Q: What does a scatter plot show in linear regression?

A

A: It shows the relationship between the dependent variable (Y) and independent variable (X), with each point representing a paired observation.

232
Q

Q: What is the goal of linear regression?

A

A: To fit a line to data that minimizes the sum of squared residuals, representing the best fit to the observed data.

233
Q

Q: How are slope and correlation related?

A

A: Slope uses variance of X, while correlation uses the product of standard deviations of X and Y. Both depend on the covariance.

234
Q

Q: What is the purpose of Ordinary Least Squares (OLS)?

A

A: OLS minimizes the sum of squared residuals to estimate the best-fitting regression line.

235
Q

Q: When might the intercept (𝑏0) lack practical meaning?

A

A: If the independent variable (X) cannot realistically take a value of zero, the intercept may not have a meaningful interpretation.

236
Q

What are the four key assumptions of a simple linear regression model?

A

A: Linearity, homoskedasticity, independence, and normality of residuals.

237
Q

What does the assumption of linearity mean in simple linear regression?

A

A: The relationship between the dependent variable
𝑌and the independent variable
𝑋 is linear, and
𝑋 is non-random.

238
Q

How can residual plots indicate a violation of the linearity assumption?

A

A: If residuals show a non-random pattern (e.g., curved or systematic trends), the linearity assumption may be violated.

239
Q

What is homoskedasticity in the context of regression?

A

A: The variance of the residuals is constant for all values of the independent variable
𝑋.

240
Q

What does a residual plot look like when the homoskedasticity assumption is violated?

A

A: Residuals may fan out or cluster at different levels of
𝑋
X, indicating heteroskedasticity.

241
Q

What does the independence assumption mean in regression?

A

A: Observations (pairs of
𝑋
X and
𝑌
Y) are uncorrelated, meaning residuals are not autocorrelated.

242
Q

How can you detect a violation of the independence assumption?

A

A: If residuals exhibit patterns (e.g., cycles or trends over time), the independence assumption may be violated.

243
Q

What is the role of the normality assumption in regression?

A

A: Residuals must be normally distributed to perform valid hypothesis testing on regression coefficient

244
Q

What is the purpose of ANOVA in regression analysis?

A

A: ANOVA decomposes the total variability of the dependent variable into variability explained by the regression (SSR) and the error (SSE). It evaluates the goodness of fit of the model using measures like the F-statistic.

245
Q

Q: How do you interpret the F-statistic in ANOVA for regression analysis?

A

A: The F-statistic tests if the slope coefficient is significantly different from zero, indicating that the independent variable explains variability in the dependent variable.

246
Q

Q: What is fintech in its broadest sense?

A

A: Fintech refers to technology-driven innovation in the financial services industry, including the design and delivery of financial products, companies developing these technologies, and the sector comprising such companies.

247
Q

What are the key characteristics of Big Data?

A

A: Big Data is characterized by the “3Vs”:

Volume: Large quantities of data.
Velocity: Rapid speed of data generation and transmission.
Variety: Data in diverse formats, such as structured, semi-structured, and unstructured data.

248
Q

Q: What is alternative data, and why is it significant in investment analysis?

A

A: Alternative data is non-traditional data (e.g., social media, sensor data) used to gain insights into consumer behavior, firm performance, and trends. It enhances investment models by providing new factors to assess security prices and trade execution.

249
Q

What are the three main sources of alternative data?

A

A:

Individuals: Data from social media, web searches, and e-commerce activities.
Business Processes: Data from transactions, corporate financials, and supply chains.
Sensors: Data from devices like smartphones, satellites, and the Internet of Things (IoT).

250
Q

What are some challenges associated with Big Data in investment analysis?

A

A: Challenges include ensuring data quality, addressing selection bias, handling missing data, and processing unstructured or large datasets, often requiring AI and machine learning techniques.

251
Q

What is the “fourth V” of Big Data, and why is it important?

A

A: The fourth V is Veracity, which relates to the credibility and reliability of data sources. Veracity is critical to ensuring accurate and trustworthy investment analysis.

252
Q

Q: What is artificial intelligence (AI)?

A

A: AI refers to computer systems capable of performing tasks that traditionally require human intelligence, such as cognitive and decision-making abilities.

253
Q

What is the main goal of machine learning (ML)?

A

A: The goal of ML is to automate decision-making processes by identifying patterns in data without human intervention or assumptions about the data’s underlying probability distribution.

254
Q

Q: What is the difference between supervised and unsupervised learning?

A

A: In supervised learning, inputs and outputs are labeled, allowing the algorithm to predict outcomes. In unsupervised learning, no labels are provided, and the algorithm identifies data structures or patterns independently.

255
Q

Q: What is deep learning, and how does it relate to neural networks?

A

A: Deep learning involves using neural networks with multiple hidden layers to perform multistage, non-linear data processing, enabling advanced pattern and relationship recognition.

256
Q

Q: What are overfitting and underfitting in machine learning?

A

A: Overfitting occurs when a model learns the training data too precisely, including noise, reducing its predictive power on new data. Underfitting happens when a model is too simplistic and fails to identify underlying data patterns.

257
Q

Q: How is AI applied in finance and other fields?

A

A: In finance, AI is used for fraud detection, Big Data analysis, and predicting market trends. Outside finance, AI powers virtual assistants, product recommendations, and victories in games like Go and poker.

258
Q

What is data science?

A

A: Data science is an interdisciplinary field that combines advances in computer science, statistics, and other disciplines to extract information from Big Data or data in general.

259
Q

What are the five key data processing methods for managing Big Data?

A

A: The five key data processing methods are:
Capture: Collecting and transforming data into a usable format.
Curation: Ensuring data quality and accuracy through cleaning.
Storage: Recording, archiving, and accessing data based on its structure.
Search: Querying and reviewing large datasets.
Transfer: Moving data to analytical tools via feeds or other methods.

260
Q

Q: Why is data visualization important for Big Data?

A

A: Data visualization helps format, display, and summarize Big Data in graphical forms, enabling users to identify trends, uncover relationships, and analyze data more effectively, even when dealing with unstructured datasets.

261
Q

Q: What is text analytics, and how is it applied in investment management?

A

A: Text analytics involves analyzing large, unstructured text datasets to derive meaning, such as identifying future performance indicators from company filings, earnings calls, and social media.

262
Q

Q: What is NLP, and what are its key applications in finance?

A

A: NLP is a field combining computer science, AI, and linguistics to analyze human language. Applications include sentiment analysis, topic analysis, fraud detection, and compliance monitoring in finance.

263
Q

Q: Which programming languages are commonly used in data science?

A

A: Common programming languages include Python (user-friendly for fintech), R (statistical analysis), Java (internet applications), C/C++ (high-frequency trading), and Excel VBA (customized automation).

264
Q

Q: How can NLP and ML improve investment decision-making?

A

A: NLP and ML can analyze large datasets like earnings reports, analyst commentary, and social media posts to identify sentiment, detect trends, and forecast market events, enhancing investment performance.

265
Q

The value of an investment increases 5% before commissions and fees. This 5% increase represents:

A)
the investment’s net return.

B)
the investment’s gross return.

C)
neither the investment’s gross return nor its net return.

A

Explanation
C) Gross return is the total return after deducting commissions on trades and other costs necessary to generate the returns, but before deducting fees for the management and administration of the investment account. Net return is the return after management and administration fees have been deducted. (Module 1.3, LOS 1.e)

266
Q

For an equity share with a constant growth rate of dividends, we can estimate its:

A)
value as the next dividend discounted at the required rate of return.

B)
growth rate as the sum of its required rate of return and its dividend yield.

C)
required return as the sum of its constant growth rate and its dividend yield.

A

Correct Answer
Explanation
Using the constant growth dividend discount model, we can estimate the required rate of return as
k
e
=
D
1
V
0
+ g
c
. The estimated value of a share is all of its future dividends discounted at the required rate of return, which simplifies to
V
0
=
D
1
k
e
- g
c
if we assume a constant growth rate. We can estimate the constant growth rate as the required rate of return minus the dividend yield. (Module 2.2, LOS 2.b)

267
Q

A dataset has 100 observations. Which of the following measures of central tendency will be calculated using a denominator of 100?

A)
The winsorized mean, but not the trimmed mean.
r
B)

C)
Neither the trimmed mean nor the winsorized mean.

A

Incorrect Answer
Explanation
A The winsorized mean substitutes a value for some of the largest and smallest observations. The trimmed mean removes some of the largest and smallest observations. (Module 3.1, LOS 3.a)

268
Q
A