F3 Mathematical Foundations / Probability Theory II Flashcards

1
Q

What is the expected value?

A

The mean. Each outcome is weighted by its probability.

Theoretical: E(X)=μ
Empirical: E(X)=x-bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Is the expected value a linear operator?

A

Yes.

Additivity: E(x+y)=E(x)+E(y)
Homogeneity / scalar multiplication: E(ax)=a*E(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the expected value of a constant?

A

The constant: E(a)=a

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between the theoretical and empirical expected value?

A

Theoretical: The sum of all outcomes (x) times their probability P(X=x)

Empirical: The sum of all observations divided by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the first moment about zero

A

The mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the first central moment?

A

Always zero as central moments describe the shape of a probability distribution relative the mean of the distribution. Irrelevant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the second central moment?

A

Var(x) = [(x - μ_x )^2]

The variance of the distribution, measuring the spread or variability around the mean. We are “punished” exponentially for very big outliers because of the square.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the lower bound of the variance and what is the variance of a constant?

A

Var(x) > 0.
Var(a) = 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the third central moment?

A

Measures the skewness in absolute terms. Normally it’s standardized by the standard deviation:

= 0: symmetrical distribution
>0: Right skewed
<0: Left skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the fourth central moment?

A

Measures the “tailedness” or “peakedness” of the distribution. Again, we use the standardized kurtosis.

=3: Normal distribution (mesokurtic)
<3: Flatter peak
>3: Sharper peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the covariance?

A

Describes the relation between two variables. means. It measures the joint variability/linear relationship.

The bounds are -∞ to ∞.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is the covariance interpreted?

A

How variables move around their mean on average.

> 0: When x is above its mean value, we would expect y to be above its mean value as well.

<0: When x is above its mean value, we would expect y to be below its mean value.

=0: Independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the equation for the covariance

A

Cov(X,Y) = E([x - μ_x ] *[y - μ_y ])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the covariance of Cov(x,x)?

A

Var(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What happens if you add a constant to the covariance?

A

Adding a constant to the variance of something doesn’t change it at all.

Cov(β+x,α+y)=Cov(x,y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the bounds of correlation?

A

-1 to 1. It’s the normalized measure (by standard deviation) of the covariance between to variables.

17
Q

What is a PDF?

A

Probability density function: The relative frequency or likelihood of drawing observations in and interval (≠ probability).

The PDF allows me to compare the relative likelihood of the random variable falling within different intervals.

If the PDF is higher for one interval compared to another, then the probability of drawing a value from that interval is higher

18
Q

What is the probability of a point estimate in a PDF?

A

Zero as probability is always defined over an interval in PDF

19
Q

What is the CDF?

A

What is the probability up to a certain point? The integral of the PDF from upper to lower bound.

The CDF will always be smooth for continuous variables because there is no definitive probabilities for point estimates.

20
Q

Mention five continuous distributions

A

Uniform
Normal distribution
Standard normal distribution
Logistic
Student t distribution

21
Q

How is the PDF and CDF denoted?

A

PDF = f(x)
CDF = F(x)

22
Q

What does the CDF for uniform distribution look like?

A

It’s linear as an increase in x has the same effect anywhere in the distribution.

23
Q

Describe the normal distribution

A

It’s bell shaped and has two parameters:

μ (location)
σ^2 (scale)

24
Q

Describe the standard normal distribution. Draw the PDF and the CDF.

A

All normal distributions can be transformed into a standard normal distribution. Denoted ϕ

μ = 0 (location)
σ^2 = 1 (scale)

CDF: Φ(0)=0,5 because the distribution is symmetrical. The highest increase in probability is just around zero.

PDF: Bellshaped
CDF: A hill

25
Q

What is the student t distribution used for?

A

Used for testing the statistical significance of our beta coefficient. Critical value ±1,96.

26
Q

How do we obtain a large t-value?

A

Either by a large coefficient or by a small standard error.

What makes the standard error smaller?
(1) Big sample size
(2) Low variance in regression line (SSR)
(3) High variance in x-values