CT3 - Probability & Statistics Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

How do you derive a moment from a moment generating function (m.g.f)?

A

Continuous function MGF:
integral of etxf(x)

Discrete function MGF: sum[0,∞] of etxf(x)

Differentiate MGF i times for i’th moment and then evaluate derivative of M(t) at t=0

e.g. for second moment - E(X2), use M’‘(t) at t = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a Standard Error?

A

The standard error (SE) is given by

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

DBN of Sample Variance v. Population Variance

A

For a sample of size n with a sample variance S2 from a population with a population variance of σ2 where (n - 1) is degrees of freedom:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Z-Score?

A

For a given point estimate (e.g. mean of a sample), the Z-score (z) is given by the following (σ should be the population SD, but may be substituted with the sample SD if the value for the population is not known and the sample is not too small)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the Expectation and Variance of a Chi-Square Distribution?

A

If X has the chi-square distribution with n degrees of freedom then

E(X)=n
var(X)=2n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the First and Second Moments of the Binomial Distribution?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the Central Limit Theorem?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define the Significance Level α

A

Significance Level α given by:

α = P(reject H0 when H0 is true)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define Sample Variance

A

Sample variance S2, for a sample of size n, is given by:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Prove that Sample Variance should be Equal to Population Variance

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an F distribution?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the Cramer-Rao Lower Bound (CRLB)?

A

The Frechet-Cramer-Rao (or just Cramer-Rao) Lower Bound (CRLB) provides the lower bound for the variance of an unbiased estimator. If an estimator’s variance is equal to the CRLB, then it is described as optimal - i.e. no estimator can be more efficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the Covariance Formula for Calculating the Correlation Coefficient?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Raw Score Formula for Calculating the Correlation Coefficient?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the t-statistic for testing the sample correlation coefficient?

A

For a sample, it is the sample correlation coefficient r, rather than the population correlation coefficient ρ that is calculated. As such, r is an estimate of ρ.

If ρ=0, r ≈ 0

The distribution of r, given ρ = 0 is given by the test statistic t, which has a Student t distribution with n - 2 degrees of freedom.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the Least Squares Line of Regression?

A

If two variables X and Y appear to have a correlation, then the approximate dependence between X and Y may be modelled using a polynomial.

For a linear dependence of the form y = ax + b the principle of least squares may be used i.e., L = E[Y - aX - b]2 aand derivatives taken w.r.t a and b and equated to zero in order to derive:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How are the Moments about the Mean Derived from the Cumulant Generating Function for n=2, n=3

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Compound Distribution Mean and Variance
for N=n

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Compound Distribution - Compound Mean

A
20
Q

Compound Distribution - Compound Variance

A
21
Q

Variance of Sum of Uncorrelated Variables and their Mean

A
22
Q

What is the Method of Moments?

A

The method of moments is a method of estimation of population parameters such as mean, variance, median, etc. (which need not be moments), by equating sample moments with unobservable population moments and then solving those equations for the quantities to be estimated.

23
Q

What is the Central Tendency of a Distribution?

A

The central tendency of a distribution locates the “center” of a distribution of values.

The three major types of estimates of central tendency are

  • the mean
  • the median
  • and the mode.
24
Q

How do you Calculate the First and Third Quartiles of a Data Set

A

To calculate the lower and upper quartiles, split the data set in two (if there is an odd number of data items, exclude the median from the calculation).

First Quartile is median of first half of split data set

Second Quartile is median of second half of split data set

25
Q

What is Skewness?

A

Skewness is a measure of the asymmetry of the probability distribution. The skewness value is a real number, i.e. can be positive or negative. In some situations skewness can be undefined.

Qualitatively, a negative skew indicates that the tail on the left side of the distribution is longer than the right side.

26
Q

What is a Probability Tree?

A
27
Q

What is a Sample Space, an Event (Elementary / Compound)

A

The sample space is an exhaustive list of all the possible outcomes of an experiment.

Each possible result of such a study is represented by one and only one point in the sample space, which is usually denoted by Ω.

An event is any collection of outcomes of an experiment.

Formally, any subset of the sample space is an event.

Any event which consists of a single outcome in the sample space is called an elementary or simple event.

Events which consist of more than one outcome are called compound events.

28
Q

How to Calculate r Permutations from n

nPr

A
29
Q

Key Properties of

n Choose r

A
30
Q

Normal Approximation to the Binomial

A

If np and n(1 - p) > 5

Can approximate binomial using normal distribution with a continuity adjustment

In Binomial,

P(X ≤ x) = P(X < x + 1)

For Y, Normally Distributed random variable approximating binomial X:

P(X < x + 1) ≈ P (Y ≤ x + 1/2)

Where X is a binomially distributed random variable and Y is a normally distributed random variable with the same expectation and variance as X

  • E[Y]=E[X]=np*
  • Var(Y) = Var(X) = np(1-p)*
31
Q

What is the Quadtratic Solution
for ax2 + bx + c = 0

A
32
Q

Define Likelihood Function and Maximum Likelihood Estimate (MLE)

A

NB: When taking initial Log L(θ), any constant terms can be lumpted together as “constant”, as they will be lost in the first differentation.

33
Q

What is Integration by Parts

A

Integration by parts is a theorem that relates the integral of a product of functions to the integral of their derivative and antiderivative.

34
Q

ρ, r, R2 - coefficients

A
  • ρ* - population correlation coefficient
  • r* - sample correlation coefficient (estimator for ρ)
  • R2* - coefficient of determination = r2
  • (where intercept is included, for linear regration)*
35
Q

Assumptions of ANOVA

A

In an ANOVA model, the jth value in the ith treatment (group) - Yij, is assumed to take the value:

Yij = μ + τi + eij

μ - population (grand) mean

τi - ith treatment effect

eij - normally distributed residual error

  • Independence of observations (iid) – this is an assumption of the model that simplifies the statistical analysis.
  • Normality
    the distributions of the residuals are normal.
  • Common variance of data in the groups - “homoscedasticity”
    the variance of data in groups should be the same.
36
Q

Define Mean Squared

(MSE)

A

Mean Squared Error (MSE) is a measure of the goodness of fit of an estimator, defined as:

37
Q

nth Central Moment

A
38
Q

Fisher Transformation for Sample Correlation Coefficient

A

If r is the sample correlation coefficient for (X,Y) with a bivariate normal distribution, then the value z given by the Fisher z transformation is approximately normally distributed:

39
Q

What is the Bernoulli Distribution?

A

The Bernoulli Distribution is simply the BInomial Distribution with n=1

i.e. It is a discrete distribution for one even where the probability of it happening = p and the probability of it not happening q = 1 - p

40
Q

Point Estimation of Error Variance for Linear Regression

A
41
Q

MGF of a Sum of Random Variables

A

Let X and Y be independent random variables.

Let Z = X + Y. Then the mgf of Z is given by

MZ(t) = MX(t)MY(t)

If X1, X2, · · · , Xn are independent and identically distributed, then

MX1+X2+···+Xn(t) = [M(t)]n

where *M(t) = MXj(t) *
is the common mgf of the Xj’s.

Proof.

E[etZ] = E[et(X+Y)] = E[etXetY]

= E[etX] E[e<span>tY</span>] = MX(t)MY(t)

42
Q

What is the Dbn of a linear Combination of Χ2 (chi-square) variables

A

If there are n chi-square variables, each with ri degrees of freedom

X1 ~ Χ2 (r1)
X2 ~ Χ2 (r2)
….
Xn ~ Χ2 (rn)

  • Y = X1 + X2 + … + Xn*
  • Y ~ Χ2(r1 + r2 + … + rn)*
43
Q

What is the Dbn of a Linear Combination of Normal Variables?

A

If there is a linear combination of N normal variables:

X1 ~ (μ112)
X2 ~ (μ222)

X2 ~ (μ222)

  • Y = X1+X2+…+Xn*
  • Y~ (μ12+… +μn, σ1222+…+σn2)*
44
Q

DBN of RSS and Error Variance

A

If RSS is residual sum of squares for n data points in relation to a particular line of regression, the estimator for the variance of the error is given by:

45
Q

Prove E[E[Y|X]] = E[Y]

A
46
Q

What Happens to Correlation Coefficient Under Linear Transformation of Data?

A

If a paired data set has a sample correlation r, if the data is transformed using a series of one or more linear transformations:

Sign may change, depending on transformation

rold| = | rtransformed|