Central Limit Theorem Flashcards
1
Q
Central Limit Theorem
A
- As you take more samples, especially large ones(more than 30), your graph of the same means (averages) will look more like a normal distribution.
- The sample mean distribution of a random variable will assume a near-normal or normal distribution if the same size is large enough.
- The theorem states that the sampling distribution of the mean
- The distribution of the same means approaches normal regardless of the shape of the parent population
- Same means (s) will be normally more distributed around (µ) than the individual readings (Xs).
- As n - the sample size - increases, then the sample averages (Xs means) will approach a normal distribution with mean(µ).
2
Q
Central Limit Theorem Formula
A
z= x̅- µ/ σ/√n
Use Z-Score
µ = population mean
σ = population standard deviation
n = sample size
x̅ = sample mean
3
Q
Significance of Central Limit Theorem
A
- The central limit theorem is one of the most profound and useful results in all statistics and probability.
- The large samples (more than 30) from any sort of distribution the same means will follow a normal distribution.
- The spread of the sample means is less (narrower) than the spread of the population you’re sampling from. So, it does not matter how the original population is skewed
- The means of the sampling distribution of the mean is equal to the population mean:
- µx̅ =µX
- The standard deviation of the sample means equals the standard deviation of the population divided by the square root of the sample size:
- σ(x̅) = σ(x) / √(n)
- The means of the sampling distribution of the mean is equal to the population mean:
4
Q
Central Limit Theorem Assumptions
A
- Samples must be independent of each other
- Samples follow random sampling
- If the population is skewed or asymmetric, the sample should be sufficiently large (for example minimum of 30 samples)
5
Q
Why Central Limit Theorem is Important
A
- The Central Limit Theorem allows the use of confidence intervals, hypothesis testing, DOG, regression analysis, and other analytical techniques.
- Many statistics have approximately normal distributions for large sample sizes, even when we are sampling from a distribution that is non-normal
- This means that we can often use well developed statistical inference procedures and probability calculations that are based on a normal distribution, even if we are sampling from a population that is not normal, provided we have a large sample size.
6
Q
Unimodal Distribution
A
- Unimodal Distribution will have only one peak or only one frequent value in the data set.
- Unimodal will have only one mode, the values are increases first and reaches to peak (e.i. is the model or the local maximum) and the decreases.
- Mode is one of the measures of central tendency. Mode is the value that appears most often in a set of data values or a frequent number
- Normal Distribution is the best example of Unimodal.
- Bimodal distribution means there are two different modes, and multimodal means more than two different modes.
7
Q
Central Limit Theorem Example
Less Than
A
- A population of 65 years male patient blood sugar was 100 mg/dL with a standard deviation of 15 mg/Dl. If a sample of 4 patients’ data were drawn, what is the probability of their mean blood sugar is less than 120 mg/dL?
- µ = 100
- x̅ = 120
- n=4
- σ =15
- z= x̅- µ / σ/√n
- =120-100/15/√4
- =20/7.75
- =2.66
- Look up on the Z table z=2.6 on left and z=0.06 on top
- =0.9961
- =99.61%
- Look up on the Z table z=2.6 on left and z=0.06 on top
8
Q
Central Limit Theorem Example
Between
A
- A population of 65 year old male patients blood sugar was 100 mg/dL with a standard deviation of 20 mg/dL. If a sample of 9 patients’ data were drawn, what is the probability of their mean blood sugar is between 85 and 105 mg/dL?
- First, compute P(x<105)
- µ = 100
- x̅ = 105
- n=9
- σ =20
- Compute the Z score z= x̅- µ/ σ/√n
- = 105-100/20/√9
- =5/6.67
- =0.75
- On Z-Table find z=0.7 on left and z=0.05 on top
- =0.7734
- =77.3%
- On Z-Table find z=0.7 on left and z=0.05 on top
- Second, compute P(x<85)
- µ = 100
- x̅ = 85
- n=9
- σ =20
- Compute the Z score z= x̅- µ/ σ/√n
- = 85-100/20/√9
- = -15/6.67
- = -2.24
- On negative Z-Table find z= - 2.2 on left and z=0.04 on top
- = 0.0125
- = 1.25%
- Compute the Z score z= x̅- µ/ σ/√n
- Third, since we are looking for blood sugar between 85 and 105 mg/dL P(85
- = 77.3-1.25
- = 76.05%
- Answer: The probability of mean blood sugar is between 85 and 105 mg/dL is 76.05%
9
Q
Central Limit Theorem Example
Greater Than
A
- A population of 65 year old made patients blood sugar was 100 mg/dL with a standard deviation of 20 mg/dL. If a sample of 16 patients’ data were drawn, what is the probability of their mean blood sugar is more than 90 mg/dL?
- µ = 100
- x̅ = 90
- n=16
- σ =20
- Compute the Z score z= x̅- µ/ σ/√n
- = 90-100/20/√16
- = -10/5
- = -2
- In Z table locate z -2.0 on left side and z 0.00 on top
- = 0.0228
- = 2.27%
- In Z table locate z -2.0 on left side and z 0.00 on top
- Since we are looking for blood sugar more than 90 mg/dL
- = 100% - 2.28%
- = 97.72%
- Answer: The probability of mean blood sugar is greater than 90 mg/dL is 97.72%
*
- Compute the Z score z= x̅- µ/ σ/√n
10
Q
Confidence Intervals
A
- Confidence intervals are used to calculate a degree of certainty that the sample group accurately represents the entire population from which they are drawn.
- Another way of thinking of it is that if you drew the same sized sample group hundreds of times and performed the same measurements, a certain percentage of confidence internals in those sample groups will contain the population mean
- A confident interval is a range of values.
- You have a percentage of certainty that the mean of the population lies within that range in any given sample from that population.
11
Q
The x-bar control charts makes use of what statistical concept?
A
- The Central Limit Theorem
- The Central Limit Theorem is the concept that states that the average values of samples drawn for any statistical universe, regardless of the shape of the distribution of the population of that universe, will tend toward a normal distribution as the sample size grows
12
Q
Which distribution describes the probability of r occurrences in n trails of an event?
A
- Binomial
13
Q
A