Quantitative Methods Flashcards

Acquire knowledge of time value of money, data organization and visualization, probability concepts, common probability distributions, sampling and estimation, hypothesis testing, and linear regression.

1
Q

Define:

A priori probability

A

Probability based on logical analysis rather than on observation/experience or personal judgment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define:

Absolute dispersion

A

The amount of variability, without comparison to any reference point or benchmark.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define:

Absolute frequency

A

The number of times an observation occurs for a particular variable or interval.

(for grouped data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define:

Addition rule

for probabilities

A

The probability that A or B occurs equals the probability that A occurs, plus the probability that B occurs, minus the probability that both A and B occur.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define:

Annuity

A

A finite set of level sequential cash flows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define:

Annuity due

A

An annuity where the first cash flow is paid immediately.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define:

Bernoulli random variable

A

A random variable with outcomes of either 0 or 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define:

Bernoulli trial

A

An experiment which produces one of two outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define:

Binomial model

A

An options pricing model where the underlying price can move to one of two possible new prices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define:

Binomial random variable

A

The number of successes in n Bernoulli trials

probability of success is constant for all, and trials are independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define:

Binomial tree

A

Model of asset price dynamics where the asset moves up with probability p or down with probability (1 – p).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define:

Coefficient of variation

A

(CV) The relationship between the standard deviation of a set of observations and their mean value.

Makes it easier to compare “risk/reward” across datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define:

Combination

A

Ways to choose r objects from n total objects. Order does not matter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define:

Conditional probability

A

The probability of an event occurring, given another event has occurred.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define:

Continuous random variable

A

A random variable whose range of possible outcomes is the real line (all real numbers between −∞ and +∞) or some subset of the real line.

uncountable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define:

Continuously compounded return

A

ln(1+HPR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define:

Correlation

A

A measure of the comovement (linear relationship) between two random variables.

A number between −1 and +1

A standardized version of covariance, demonstrating direction and strength of linear relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define:

Correlation coefficient

A

A number between −1 and +1 measuring the linear relationship between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Define:

Covariance

A

A measure of the co-movement (linear association) between two random variables.

Measure of how the variation in two variables changes together. Covariability of the two variables around their respective means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Define:

Covariance matrix

A

A matrix whose entries are covariances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Define:

Cross-sectional analysis

A

Analysis that involves comparisons across individuals in a group at a point in time.

Differs from time-series analysis, which analyzes over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Define:

Cross-sectional data

A

Observations over individual units at a point in time.

Differs from time-series data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Define:

Cumulative distribution function

A

A function giving the probability that a random variable is less than or equal to a specified value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Define:

Cumulative relative frequency

A

The fraction of total observations less than the upper limit of a stated interval.

For data grouped into intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Define:

Data mining

A

Determining a model by repeatedly searching through a dataset for statistically significant patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Define:

Degree of confidence

A

The probability that a confidence interval includes the unknown population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Define:

Degrees of freedom (df)

A

The number of independent observations used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Define:

Dependent Events

A

When one event occurring depends on the occurrence of another event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Define:

Descriptive statistics

A

Summarizing and communicating the characteristics of the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Define:

Discount

A

1)Reducing value of a future payment for how far away it is in time (calculating present value of a future amount).
2) the amount an instrument is priced below its face (par) value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Define:

Discount rates

A

The interest rate used to calculate a present value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Define:

Discounted cash flow models

A

Valuation models that estimate the intrinsic value of a security as the present value of the future benefits expected to be received (dividends).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Define:

Discrete random variable

A

A random variable which can be a countable number of possible values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Define:

Dispersion

A

Variability around the central tendency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Define:

Effective annual rate

EAR

A

The actual rate of return in a year, when interest throughout the year is compounded.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Define:

Effective annual yield

EAY

A

An annualized return that accounts for the effect of compounding interest.

aka EAR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Define:

Empirical probability

A

The probability of an event, estimated as a relative frequency of occurrence, based on experimentation or historical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Define:

Estimate

A

The value calculated, using an estimator, from sample observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Define:

Estimator

A

An estimation formula

Formula to calculate sample mean is an example of an estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Define:

Estimation

A

Estimating the value of a population parameter.

Statistical Inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Define:

Event

A

An outcome, or set of outcomes, of a random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Define:

Excess kurtosis

A

Degree to which the peakedness exceeds that of the normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Define:

Exhaustive

A

Covering all possible outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Define:

Expected value

A

The probability-weighted average of all possible outcomes for a random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Define:

Frequency distribution

A

Display of data summarized into a small number of intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Define:

Future value (FV)

A

The amount a payment, or series of payments, will grow to by a future date.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Define:

Geometric mean

A

A measure of central tendency computed by taking the nth root of the product of n values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Define:

Harmonic mean

A

A type of weighted mean computed by averaging the reciprocals of the observations, then taking the reciprocal of that average.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Define:

Histogram

A

A bar chart of data grouped into a frequency distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Define:

Holding period return

A

The return earned during a specified holding period.

aka total return for the period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Define:

Hypothesis testing

A

The testing of hypotheses about one or more populations.

Major part of inferential statistics. Using a sample to better understand the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Define:

Independent

A

When the occurrence of one event does not change the probability of another event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

Define:

Internal rate of return

A

The discount rate that results in NPV = 0; said differently, the discount rate where PV( investment’s costs)= PV(investment’s benefits).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

Define:

Interest rate

A

A rate of return that reflects the relationship between cashflows at different points in time; a discount rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

IDefine:

IRR rule

A

Investment decision rule, accept a project or investment if IRR > opportunity cost of capital.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

Define:

Joint probability

A

The probability that both stated events occur.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Define:

Kurtosis

A

The statistical measure of peakedness of a distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Define:

Leptokurtic

A

When a distribution is more peaked than a normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Define:

Level of significance

A

The probability of a Type I error in testing a hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Define:

Linear interpolation

A

Estimating an unknown value by using two known values which fall on either side of it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

Define:

Logarithmic scale

A

A scale in which equal distances represent equal proportional changes in the underlying quantity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Define:

Linear scale

A

A scale in which equal distances correspond to equal absolute amounts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Define:

Mean absolute deviation

A

The mean of the absolute values of deviations from the sample mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Define:

Measure of central tendency

A

A quantitative measure of where data are centered.

Mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

Define:

Median

A

The value of the middle item when sorted into ascending or descending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

Define:

Mesokurtic

A

When a distribution has the same kurtosis as the normal distribution.

meso- means middle or moderate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

Define:

Mode

A

The most frequently occurring value in a set of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

Define:

Money-weighted return

A

The IRR of a portfolio, considering all cash flows. It measures the rate of return over time while considering the impact of withdrawals, deposits, and transfers.

The money weighted return = IRR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

Define:

Monte Carlo simulation

A

A simulation with repeated random sampling, given a set of variable inputs, which is used to estimate a probability distribution of outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

Define:

Multivariate distribution

A

A probability distribution for a group of related random variables.

multivariate = multiple variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

Define:

Multiplication rule for probabilities

A

The joint probability of events A and B equals the probability of A given B multiplied by the probability of B.

P(A and B) = P(A|B) x P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

Define:

Multivariate normal distribution

A

A probability distribution for a group of random variables; completely defined by the means, variances, and correlations of the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

Define:

Mutually exclusive projects

A

When the projects or choices compete directly with each other. It is not possible to choose/complete both, it is a binary decision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

Define:

n Factorial

n!

A

For a positive integer n, the product of the first n positive integers

0 factorial equals 1 by definition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

Define:

Nash equilibrium

A

When two or more participants in a non-cooperative game have no incentive to deviate from their own equilibrium strategies given their opponent’s strategies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

Define:

Net present value

NPV

A

The present value of an investment’s cash inflows (benefits) minus the present value of its cash outflows (costs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

Define:

Nonconventional cash flow

A

A cashflow pattern where the initial outflow is not followed by inflows only. Cash flows can flip multiple times from positive (inflows) to negative (outflows) and back to positive again.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

Define:

Normal distribution

A

A continuous, symmetric (no skew) probability distribution that is completely described by its mean and variance.

79
Q

Define:

NPV rule

A

An investment decision rule: An investment should be undertaken if its NPV is positive.

80
Q

Define:

One-tailed hypothesis test

A

Unlike a two-tailed test, where the null is rejeted if the evidence indicates the parameter is either greater than or less than the hypothesized value, the null hypothesis of a one-tailed test is only rejected if the evidence indicates that the population parameter falls on one side of the hypothesized value (either greater, or less, depending on the test.

The alternative hypothesis only has one side.

81
Q

Define:

Opportunity cost

A

The value that investors forgo by choosing a particular course of action.

82
Q

Define:

Ordinary annuity

A

An annuity with a first cash flow that is paid one period from the present.

83
Q

Define:

Parameter

A

A descriptive measure computed from or used to describe a population of data.

84
Q

Define:

Perpetuity

A

A set of never-ending level sequential cash flows, with the first cash flow occurring one period from now.

A perpetual annuity

85
Q

Define:

Parametric test

A

Any test concerned with parameters, or whose validity depends on the assumption that the population is normally distributed.

86
Q

Define:

Platykurtic

A

Less peaked than the normal distribution.

87
Q

Define:

Population

A

All members of a specified group.

88
Q

Define:

Population mean

A

The arithmetic mean value of a population

89
Q

Define:

Population variance

A

A measure of dispersion relating to a population, calculated as the mean of the squared deviations around the population mean.

90
Q

Define:

Population standard deviation

A

A measure of a population’s dispersion, in the same unit of measurement as the observations.

the positive square root of the population variance.

91
Q

Define:

Power of a test

A

The probability of correctly rejecting the null (rejecting the null hypothesis when it is false).

1-𝛽

92
Q

Define:

Present value (PV)

A

The current value of future cash flows. This requires discounting the future cash flows back through time.

Can be used to determine current value of cash inflows from assets, or outflows from liabilities.

93
Q

Define:

Probability

A

The chance that an event will occur, represented by a number between 0 and 1.

94
Q

Define:

Probability distribution

A

A distribution which specifies the probabilities of a random variable’s possible outcomes.

95
Q

Define:

Probability density function

pdf

A

A function with non-negative values such that probability can be described by area under the curve.

96
Q

Define:

Probability function

A

A function that specifies the probability that the random variable takes on a specific value.

97
Q

Define:

Quantile

A

A value at which a stated fraction of the data lies below.

aka fractile

98
Q

Define:

Random variable

A

A quantity whose future outcomes are uncertain.

99
Q

Define:

Range

A

The difference between the maximum and minimum values in a dataset.

100
Q

Define:

Risk

A

The chance of a loss or adverse outcome.

Exposure to uncertainty

101
Q

Define:

Risk premium

A

Extra return required for bearing a specified risk.

102
Q

Define:

Sample

A

A subset of a population.

103
Q

Define:

Sample Statistic

A

A quantity computed from or used to describe a sample.

104
Q

Define:

Sample selection bias

A

Bias introduced by systematically excluding some of the population according to a particular attribute

105
Q

Define:

Sampling error

A

The difference between the observed value of a statistic and the quantity it is intended to estimate.

106
Q

Define:

Scenario analysis

A

Analysis that shows the changes in key financial quantities which would result from specific events. It is a risk management technique considering what the performance of a portfolio would be under specified situations.

107
Q

Define:

Sensitivity analysis

A

Analysis of the range of possible outcomes as specific variables are changed.

108
Q

Define:

Sharpe ratio

A

The average return in excess of the risk-free rate divided by the standard deviation of returns.

average excess return earned per unit of standard deviation

109
Q

Define:

Skewness

A

A quantitative measure of skew (lack of symmetry).

110
Q

Define:

Spearman rank correlation coefficient

A

A measure of correlation applied to ranked data.

111
Q

Define:

Standard deviation

A

Measure of dispersion, in the same units as the original data.

The positive square root of the variance

112
Q

Define:

Standardizing

A

Transforming a set of data by subtracting the mean and dividing the result by the standard deviation. This is done to make the different data comparable.

113
Q

Define:

Standard normal distribution

A

The normal density with mean (μ) equal to 0 and standard deviation (σ) equal to 1.

114
Q

Define:

Statistic

A

A quantity computed from or used to describe a sample of data.

115
Q

Define:

Statistically significant

A

A result indicating that the null hypothesis can be rejected.

116
Q

Define:

Systematic sampling

A

A procedure of selecting every nth member from the population until reaching desired sample size.

Resulting sample should be random

117
Q

Define:

Time-series data

A

Observations of a variable at multiple points in time.

This differs from cross sectional data, which is at a given point in time.

118
Q

Define:

t-Test

A

A hypothesis test using a t-statistic, which follows a t-distribution.

119
Q

Define:

Time-weighted rate of return

A

A measure of investment performance (compounded return) which is not sensitive to the timing or size of cashflows (in or out).

In contrast with money weighted rate of return.

120
Q

Define:

Time value of money

A

The principles which determine the relationships between cashflows at different points in time.

121
Q

Define:

Total probability rule

A

A rule explaining the unconditional probability of an event in terms of probabilities of the event conditional on mutually exclusive and exhaustive scenarios.

122
Q

Define:

Two-tailed hypothesis test

A

A test in which the null hypothesis (that the hypothesized value = a specific value) is rejected if indication is that the population parameter is either smaller or larger than a hypothesized value.

This differs from a one tailed test which only considers one direction, not both.

123
Q

Define:

Type I error

A

Rejecting a true null hypothesis.

124
Q

Define:

Type II error

A

Not rejecting a false null hypothesis.

125
Q

Define:

Unconditional probability

A

The probability of an event not affected by another event.

126
Q

Define:

Variance

A

Squared deviations from a random variable’s mean (expected value).

127
Q

Define:

Weighted mean

A

An average in which each observation is weighted by its relative importance.

128
Q

Define:

Alternative hypothesis

A

The hypothesis that is accepted if the null hypothesis is rejected.

129
Q

Define:

Analysis of variance

ANOVA

A

Analysis of the sources of variation in a dataset.

130
Q

Define:

Bayes’ formula

A

Formula used to update the probability of an event, when new information has been received.

131
Q

Define:

Bimodal

A

Distribution with two modes (frequently occuring data)

132
Q

Define:

Bootstrap

A

Resampling method which repeatedly draws samples and then replaces them into the original population.

133
Q

Define:

Box and whisker plot

A

Graph representing the dispersion of data across quartiles.

134
Q

Define:

Bubble line chart

A

Line chart which uses different sized bubbles to represent a third feature of the data.

135
Q

Define:

categorical data

A

Values which describe a group of observations and therefore can be used to divide dataset into groups.

136
Q

Define:

Central limit theorem

A

The sampling distribution of a variable’s mean approaches a normal distribution as the sample size grows.

137
Q

Define:

Chi-square test

A

Test for potential association between categorical variables.

138
Q

Define:

Confidence level

A

The degree of confidence that the sample distribution represents the population distribution

Confidence that the population parameter would occur within the confidence interval.

139
Q

Define:

Continuous data

A

Data which is measureable and can take on any numerical value within a range of values.

Can be measured, not counted.

140
Q

Define:

Critical values

A

The specific test statistic value where the decision changes from fail to reject to reject the null hypothesis.

141
Q

Define:

Cumulative frequency distribution chart

A

Either cumulative relative or cumulative absolute frequency plotted on y axis against the upper limit of the interval.

Easy to visualize what percent of observations is below a certain value.

142
Q

Data

A

Information

Can be many formats: words, numbers, images, audio

143
Q

Define:

Dependent variable

A

Variable whose variation is explained by the regression.

Variation results from variation in the independent variable.

144
Q

Define:

Error term

A

Difference between an observation and the expected value.

145
Q

Define:

Estimated parameters

A

In regression analysis, estimated values of the population intercept and slope.

146
Q

Define:

False discovery rate

A

Rate of type 1 errors when testing a null hypothesis repeatedly for a specific level of significance.

147
Q

Define:

False discovery approach

A

Adjustment to p-values when test is performed multiple times.

148
Q

Define:

Fat tailed

A

When a distribution has a higher probability of extreme outcomes (fatter tails) than a normal distribution.

leptokurtic

149
Q

Define:

Grouped bar chart

A

Bar chart showing joint frequencies for two categorical variables.

150
Q

Define:

Heteroscedasticity

A

When variance of the error term differs across observations.

When the variance varies

151
Q

Define:

Independent variable

A

The explanatory variable. (Explains the dependent variable in a regression),

152
Q

Define:

Indicator Variable

A

Variable with value of either 0 or 1, based on a binary condition.

“Dummy Variable”

153
Q

Define:

Intercept

A

Expected value of the dependent variable in a simple linear regression if the independent is zero.

Where the regression line intercepts the y axis, as the x-axis is 0.

154
Q

Define:

Inverse transformation method

A

Using randomly generated numbers from a continuous uniform distribution in order to generate random observations from any distributions.

155
Q

Define:

Jackknife

A

Resampling method which takes from original observed data sample and leaves out one observation at a time (without replacement).

156
Q

Define:

Judgmental sampling

A

Selecting from a population based on own knowledge and judgement.

157
Q

Define:

Lin-log model

A

Regression model with independent variable in logarithmic form.

158
Q

Define:

Linear regression

A

regression modeling the straight line relationship between the dependent and independent variables.

159
Q

Define:

Log-log model

A

Regression model where both dependent and independent variables are in logarithmic form.

160
Q

Define:

Log-lin model

A

Regression model where dependent variable is in logarithmic form.

161
Q

Define:

Mean square error (MSE)

A

Sum of squares error divided by the degrees of freedom

df =(n - k - 1)

162
Q

Define:

Mean square regression (MSR)

A

Sum of squares regression divided by the number of independent variables, k

163
Q

Define:

Multiplication rule for independent events

A

When two events are independent, the joint probability of the two is the product of the individual probabilities of each.

164
Q

Define:

Multiple testing problem

A

Risk of getting statistically significant result when performing test multiple times.

165
Q

Define:

Non-probability sampling

A

Sampling based on factors other than probability, like judgement or convenience.

166
Q

Define:

Null hypothesis

A

The hypothesis being tested.

Either reject, or fail to reject (result is not statistically significant).

167
Q

Define:

Observation

A

Value of a specific variable

Can becollected at a point in time or over a period.

168
Q

Define:

Overfitting

A

When too many independent variables are used to try and explain the dependent. This results in the coefficients being noise rather than a true relationship.

Can cause false sense of predictive power.

169
Q

Define:

p-value

A

Smallest level of significance at which the null can be rejected.

170
Q

Define:

Pearson correlation

A

Measure of the relationship between two variables.

Correlation

differs from spearman’s rank correlation

171
Q

Define:

Probability sampling

A

Sampling method where every member of a population has an equal chance of being selected.

172
Q

Define:

Probability tree diagram

A

Diagram where branches, with the probabilities, stem from nodes, representing events.

The branches are mutually exclusive

173
Q

Define:

Regression analysis

A

Used to examine if a variable is useful in explaining another variable.

174
Q

Define:

Regression coefficients

A

Intercept and slope coefficients of a regression.

175
Q

Define:

Resampling

A

Repeatedly drawing samples from original data sample in order to statistically infer population parameters.

176
Q

Define:

Residual

A

Difference between the observation and the predicted value.

177
Q

Define:

Sample correlation coefficient

A

Standardized measure of how two variables move together.

standardizing the covariance

done by dividing the covariance by the product of the two variables’ standard deviation

178
Q

Define:

Scatter plot

A

When two variables are plotted along the axis and points represent pairs of the two variables.

179
Q

Define:

Slope coefficient

A

Coefficient of an independent variable.

Average change in the dependent variable per one-unit change in the independent.

180
Q

Define:

Spurious correlation

A

When a correlation between variables is not a causal relationship. It may actually be due to chance or a relationship with a third variable.

181
Q

Define:

Stacked bar chart

A

Bars represent sub-groups and are placed on top of each other to form a single bar. Each sub section is a different color to represent its contribution, and overall height represents the marginal frequency for the category.

182
Q

Define:

Standard error of the estimate

A

Measure of the fit of a regression line.

Square root of the Mean Squared Error

183
Q

Define:

Standard error of the forecast

A

Measure of the uncertainty associated with a forecasted value of the dependent variable.

184
Q

Define:

Standard error of the slope coefficient

A

The ratio of the SEE:square root of the variation of the independent variable.

185
Q

Define:

Stratified random sampling

A

Sampling method in which the population is divided into subpopulations (strata) based on criteria, then random samples are drawn from each.

Sample proportional to size of stratum relative to overall population.

186
Q

Define:

Structured data

A

Data that are highly organized in a pre-defined manner.

187
Q

Define:

Sum of squares error

SSE

A

Sum of the squared deviations of the observed dependent variable and the estimated value (based on the estimated regression line).

Aka the residual sum of squares.

188
Q

Define:

Sum of squares regression

SSR

A

Sum of the squared deviations of the estimated value of the dependent variable (based on the regression line), and the mean of the dependent variable.

189
Q

Define:

Sum of squares total

SST

A

Sum of the squared deviations of the observed dependent variable from its mean.

SST=SSR+SSE

190
Q

Define:

Thin tailed

A

A distribution with a lower number of extreme outcomes than the normal distribution.

platykurtic

191
Q

Define:

Tree map

A

Set of colored rectangles representing groups, the area of each is proportional to the value of the group.

192
Q

Define:

Two dimensional rectangular array

A

A data table. Columns and rows hold multiple variables and observations.

193
Q

Define:

Unimodal

A

Distribution with one mode (most fequent value).

194
Q

Define:

Variable

A

Quantity or characteristic which can be measured, counted, or categorized and is subject to change.