Statistics & Properties Flashcards

Question 1

Q

Sample variance formula

Answer

A

Sum (Xi - X bar)^2 / (n - 1)

Question 2

Q

Skewness formula & what does it measure?

Answer

A

Measure of asymmetry 
Sum of (Xi - X bar)^3 / ns^3

Question 3

Q

Is skewness scale invariant?

Question 4

Q

Values of skewness for no/positive/negative skews

Answer

A

Skewness = 0 = symmetric (normal distribution)
Skewness > 0 means positive / right skew
Skewness < 0 means negative / left skew

Question 5

Q

Relation between mean, median and mode for positive skew

Answer

A

Mode < median < mean

Question 6

Q

What is Kurtosis & formula

Answer

A

Measure of how many observations lie in the tails of the distribution. Not signed.

Sum of (Xi - X bar)^4 / ns^4

Question 7

Q

Values of kurtosis

Answer

A

Kurtosis = 3 = normal distribution
Kurtosis < 3 = platykurtic (flat topped)
Kurtosis > 3 = leptokurtic (peaked)

Question 8

Q

Sum of (Xi - X bar)^2 can be simplified to

Answer

A

Sum of Xi^2 - nX bar^2

Question 9

Q

Is covariance scale invariant?

Answer

A

NO - Cov(2X, Y) = 2Cov(X, Y)

Question 10

Q

Formula for correlation in terms of other measures

Answer

A

RXY = COV(X, Y) / sqrt V(X) V(Y)

Question 11

Q

Is correlation scale invariant?

Question 12

Q

What values does rxy lie between?

Answer

A

-1 and +1

Question 13

Q

Define estimator

Answer

A

A random variable that is a function of the data

Question 14

Q

Define estimate

Answer

A

An actual value drawn from the sample e.g. Sample mean = 2

Question 15

Q

What is an unbiased estimator?

Answer

A

E(Theta hat) = Theta

the mean of the sample description theta hat is centred on the population mean

Question 16

Q

Two examples of unbiased estimators

Answer

A

Sample mean X bar: E(X bar) = M

Sample variance S^2: E(S^2) = Sigma ^2

Question 17

Q

What is an efficient estimator?

Answer

A

Efficient = smallest variance

Question 18

Q

Compare the efficiency of one individual vs sample mean

Answer

A

Variance of the sample mean < variance of any 1 individual hence sample mean is more efficient.

Question 19

Q

What is central limit theorem?

Answer

A

Whatever the distribution of X, provided that sigma ^2 is finite, as n becomes large, the distribution tends towards a normal distribution. N>25/30.

Question 20

Q

When using CLT for discrete distributions, what MUST we remember to do?

Answer

A

Using normal continuous as approx to discrete = need continuity correction.

E.g. P(X < equal to 21) = P(X < equal to 21.5) then convert to Z.
P(X > 100) = P(X > equal to 101) = P(X > equal to 100.5)

Question 21

Q

What is max likelihood estimation?

Answer

A

Suppose we observe heads = 20 and tails = 30 in 50 trials. We want to find an estimate for p(success) in a binomial distribution which maximises the chance this outcome occurs.

Question 22

Q

What is a consistent estimator?

Answer

A

The probability limit of theta hat = theta.

The probability of the difference between theta hat and theta exceeding the allowed error goes to zero as n gets bigger.