F4 Mathematical Foundations / Probability Theory III Flashcards

Question 1

Q

What is the difference between the Bernoulli and the binomial distribution?

Answer

A

Bernoulli represents the outcome of a single trial.

Binomial represents the total number of successes in a fixed number of independent Bernoulli trials.

Question 2

Q

How do you pronounce this: e^x

Answer

A

E raised to the power of x

Question 3

Q

ln⁡(e^x ) = ?
ln⁡(e^1 ) = ?
ln⁡(2,71) ≈ ?

Answer

A

ln⁡(e^x )=x
ln⁡(e^1 )=1
ln⁡(2,71)≈1

Question 4

Q

Why can the log transformation be very helpful?

Answer

A

Log-transformation can be a way of dealing with extreme outliers – like income.

Question 5

Q

How does the exponential function look?

Answer

A

It’s never negative. It approaches zero never completely. The intercept=1.

Question 6

Q

What is the difference between a logit and probit model?

Answer

A

Probit Model: Assumes the error terms follow a standard normal distribution. The normal distribution is used as the link function.

Logit Model: Assumes the error terms follow a standard logistic distribution. Logit link function.

Question 7

Q

What is the standard logistic distribution used for? How is compared to the normal distribution?

Answer

A

Model binary outcomes.

It has slightly heavier tails than the normal distribution, which means it assumes there may be more extreme values.

Question 8

Q

Write up the simple binary regression and name the coefficients

Answer

A

y_i=β_0+β_1 x_i+u_i

Independent, dependent, intercept and error term.

Question 9

Q

What do we call a coefficient, when it’s an estimate?

Question 10

Q

What do we assume about the error term? Two things.

Answer

A

The expected value of the error term is zero (no systematic error). E(u)=0

X and u are independent. We cannot predict the value of u based on x. E(u|x)=E(u)=0. This is for all independent variables.

u_i ∼ N(0,σ^2)

Question 11

Q

What is a synonym for the regression line?

Answer

A

The conditional expectation function

Question 12

Q

What is the estimate for beta-1 and beta-0?

Answer

A

Beta-1: Cov(x,y)/Var(x)

Beta-0: Y-bar - beta-1x-bar

Question 13

Q

What does the following indicate? Draw it
Cov(x,y)>0:
Cov(x,y)<0:
Cov(x,y)=0:

Answer

A

Cov(x,y)>0: Indicates a positive linear relationship. As X increases, Y tends to increase.

Cov(x,y)<0: Indicates a negative linear relationship. As X increases, Y tends to decrease.

Cov(x,y)=0: Suggests no linear relationship between X and Y.

Question 14

Q

What is the support of covariance and variance?

Answer

A

Covariance: -∞ to ∞
Variance: 0 to ∞

Question 15

Q

Which of the two are a linear operator: Covariance and variance

Answer

A

Covariance: Yes
Variance: No

Question 16

Q

What are the three types of error in a regression? And which of them are equal each other when x and y are independent

Answer

A

TSS (all variation in y) = ESS (explained variation) + SSR (unexplained variation)

Independent: TSS=SSR

Question 17

Q

What is the total sum of squares (TSS)? Draw it.

Answer

A

Totalt variation y around the mean y-bar.
(y_i - y-bar)^2

How much error is there if I guess the mean every time.

Question 18

Q

What is explained sum of squares (ESS)? Draw it.

Answer

A

What does the model explain beyond guessing the mean (we want it to be high).

(y_i-hat - y-bar)^2

Question 19

Q

What is sum of squared residuals (SSR)? Draw it.

Answer

A

Variation not explained by the model (residuals)

(y_i - y_i-bar)^2 = u_i^2

Question 20

Q

What do we called the proportion of explained variation by the model? What is the equation?

Answer

A

R^2 = ESS/TSS

So, how much variation explained by the model as a proportion of the totalt amount of variation.

How closely is the regression line correlated with y_i?

Question 21

Q

What is the ‘true’ theoretical error term called when we try to estimate it?

Answer

A

Estimated error term or residual