Chapter 4: Normal Distribution Flashcards
The Normal Distribution
- The normal distribution is the most important one in all of probability and statistics.
- Many numerical populations have distributions that can be fit very closely by an appropriate normal curve.
- Examples include heights, weights, and other physical characteristics, measurement errors in scientific experiments, reaction times in psychological experiments, measurements of intelligence and aptitude, scores on various tests, and numerous economic measures and indicators.
Normal Distribution definition
Parameters of the Normal Distribution
Normal Distribution Graphs with different parameters (means and variances)
Normal Distribution Graphs with different parameters (means and variances) (contd.)
σ and µ in Normal Distribution Graph
- Each density curve is symmetric about µ and bell-shaped, so the center of the bell (point of symmetry) is both the mean of the distribution and the median.
- The value of σ is the distance from µ to the inflection points of the curve (the points at which the curve changes from turning downward to turning upward).
- Large values of σ yield graphs that are quite spread out about µ, whereas small values of σ yield graphs with a high peak above m and most of the area under the graph quite close to µ.
- Thus a large σ implies that a value of X far from µ may well be observed, whereas such a value is quite unlikely when σ is small.
Every normal curve (regardless of its mean or standard deviation) conforms to the following “rule“:
- About 68% of the area under the curve falls within 1 standard deviation of the mean.
- About 95% of the area under the curve falls within 2 standard deviations of the mean.
- About 99.7% of the area under the curve falls within 3 standard deviations of the mean.
- Collectively, these points are known as the empirical rule or the 68-95-99.7 rule. Clearly, given a normal distribution, most outcomes will be within 3 standard deviations of the mean.
The Standard Normal Distribution
Parameters of a Standard Normal Distribution
Definition
The normal distribution with parameter values µ = 0 and σ = 1 is called the standard normal distribution.
A random variable having a standard normal distribution is called a standard normal random variable and will be denoted by Z.
The pdf of Z is:
Example 13
(see powerpoint slides 13-19)
Example 14: 99th Percentile
- The 99th percentile of the standard normal distribution is that value on the horizontal axis such that the area under the z curve to the left of the value is .9900.
- Appendix Table A.3 , gives, for fixed z, the area under the standard normal curve to the left of z, whereas here we have the area and want the value of z.
- This is the “inverse” problem to P(Z <= z) = ?
- so the table is used in an inverse fashion:
- Find in the middle of the table .9900; the row and column in which it lies identify the 99th z percentile.
Here .9901 lies at the intersection of the row marked 2.3 and column marked .03, so the 99th percentile is (approximately) z = 2.33.
Percentiles of the Standard Normal Distribution
- In general, the (100p)th percentile is identified by the row and column of Appendix Table A.3 in which the entry p is found (e.g., the 67th percentile is obtained by finding .6700 in the body of the table, which gives z = .44).
- If p does not appear, the number closest to it is often used, although linear interpolation gives a more accurate answer.
- For example, to find the 95th percentile, we look for .9500 inside the table.
- Although .9500 does not appear, both .9495 and .9505 do, corresponding to z = 1.64 and 1.65, respectively.
- Since .9500 is halfway between the two probabilities that do appear, we will use 1.645 as the 95th percentile and –1.645 as the 5th percentile.
za Notation for z Critical Values
In statistical inference, we will need the values on the horizontal z-axis that capture certain small tail areas under the standard normal curve.
Notation
za will denote the value on the z-axis for which a (alpha) of the area under the z curve lies to the right of za.
(See Figure 4.19.)
For example, z.10 captures upper-tail area .10, and z.01 captures upper-tail area .01.
Since a (alpha) of the area under the z curve lies to the right of za, 1 – a of the area lies to its left. Thus za is the 100(1 – a)th percentile of the standard normal distribution.
By symmetry, the area under the standard normal curve to the left of –za is also a. The za’s are usually referred to as z critical values.
Most Useful z percentiles and za values
Example 15
Non-standard Normal Distributions
Non-standard Normal Distributions (contd.)