13 - CLT, Standard Normal, Z-tables, Kurtosis Flashcards
Normal distributions
- characterized by mean (µ), and variance σ2
- Empirical Rule
- both Binomial & Poisson distributions start to look normally distributed
- Binomial = when n gets large
- Poisson = when lamda (the rate) gets large
- convergence to normality can be explained with CLT
How to calculate probabilities for a normal random variable?
calculate by finding the area under the curve (integration, use JMP calculator or z-table)
Location shift of distribution
mean, µ controls location of the center of the distribution
can take on any value between -infinity & +infinity
Scale changes of distribution
variance, σ2, controls how spread out the distribution is
- greater σ2 = greater spread
- σ is always greater than or equal to 0
difference between normal and binomial/poisson
normal: no relationship between µ & σ
binomial & poisson = σ2 is linked to µ
Central Limit Theorem (CLT)
Under appropriate conditions, as the sample size (n) gets large, the sample mean tends to a normal distribution
*we can use normal probability calculations for inferences about the sample mean*
*the CLT is true, regardless of the distribution of X itself*
Xbarn → N(µ, σ2/n)
Xbarn = sample mean
µ = E(X)
σ2 = Var(X)
Standard Normal
Z ~ N(0,1)
a random variable standardized to have a mean of 0 and a variance of 1
How to turn a normal into a standard normal
Ex. X ~ N(5, 15)
find P(X<0)
take the z-score so we can use the z-tables!
P(X<0) = P [(X-5)/sqr(15) < (0-5)/sqr(15)]
= P [Z < (0-5)/sqr(15)]
= P (Z < -1.291)
⇒ look @z-tables, round to -1.3, find 0.0968
Using the z-tables for P(-1.3 <= Z <= -.06)
area of interest can be expressed as area to left of -0.6 minus the area to the left of -1.3
P(Z <= -0.6) - P(Z<= -1.3)
= 0.2743 - 0.0968 = 0.1775
Finding the x percentile of the distribution
Ex. Identifying the value x, associated with the event P(X<= x) = 0.8
(by how many days into the pregnancy have 80% of women given birth)
(80% from the left of the distribution)
- solve the question for z, such that P(Z<=z) = 0.8
- undo the z-score using the formula x = zσ + µ
- Use right-hand z-table, find 0.8 in 3rd column, this corresponds to 0.8416 in first column
- x = (0.8416)*(10) + 268 = 276.4
- A little over a week past the average due date (268 days)
Quantiles of the distribution
pth quantile of the distribution is the value x such that P(X<=x) = p
*for the standard normal, the quantiles are exactly the quantities in the first column of the RHS z-score table*
******for quantiles less than 0.5 (50th percentile), report the negative of the number in the first column of the table******
Normal Quantile Plot (NQP)
displays observed quantiles plotted against the expected quantiles from a standard normal distribution
departures from normality = departures from 45 degree reference line (when observed = expected)
departures from normality = R/L skew, heavy tails
NQP axes
x-axis = theoretical quantiles from a standard normal
y-axis = observed quantiles
any given point:
x-coord = theoretical normal quantile
y-coord = observed (actual) quantile
When is there a departure from normality on a NQP?
if there are points outside the confidence bands (95% of normal quantile plots have points within the bands)
NQP: symptom of…
right skewed
left skewed
heavy tails
right skewed = convexity
left skewed = concavity
heavy tails = s-shape (below ref line on LHS, above ref line on RHS)