Sampling Distributions Confidence Intervals Flashcards
What is Statistical Inference?
- Whenever a sample is selected to either learn something about, or draw conclusions regarding a larger group of items (Population)
Study the process of Inferential Statistics
https://docs.google.com/document/d/1r_ttbYs-4jXdkBbVGPH9vk1swjRRmRJUWdllcJXdaAI/edit?usp=sharing
What needs to be considered when doing Inferential Statistics?
- If we calculate a statistic from a sample, will it exactly represent the population parameter (population value) we are interested in? (Sampling Error)
What is a sampling error?
- An error in a statistical analysis arising from the unrepresentativeness of the sample taken
What needs to be considered if the calculated statistics from an Inferential statistics study do not exactly represent the population parameter we are interested in?
If not then:
- Will the sample statistic underestimate or overestimate the population parameter?
- How large will any error be?
- Is it likely that the error will be small enough that the sample statistic will be useful?
We need to know something about the possible range of errors, and the likely size of errors.
A soft drink manufacturer sells one of its popular flavours in a 600mls bottle. Fill of soft drink it is normally distributed with a mean fill of 600mls and a standard deviation fill of 10mls. What is the probability that any one bottle will have less than 598mls, i.e. P(X < 598)
Given Information: - m = 600 s = 10 - Normally Distributed - We know each bottle is different, and we can work out the probability of getting amounts of fill. - X is a random variable representing the bottle fill - P(X<598) = P(z<598-600/10) = P(z < -0.2) = 0.4207 - This example is in the google doc
How does Inferential Statistics work when using larger samples?
- (Using the bottle example) Whilst we might ask questions about one bottle, we would never test the process using only one bottle.
- First you would select a sample
- Then calculate the sample mean fill
- For example if you had three samples and each had n=25 and a different mean, then the sample mean would become a “random variable”.
- Each sample has a different error
What is a sampling distribution?
- A sampling distribution is the distribution of possible values any sample statistic may take or spread around the population parameter of interest
- The sampling distribution also takes account of the distribution of possible sampling errors
What three points are important to understand about Sampling Distributions?
- Every sample statistic calculated is a random variable
- Every random variable will have a distribution
- If we can define the distribution then we can use it to answer questions such as that posed by the bottling process example.
How do you develop a Sampling Distribution?
- Assume there is a population; Population size N=4; Random variable, X, is age of individuals; Values of X: 18, 20, 22, 24 (years)
- First the mean age & St. Dev must be found
- Then, the second set of observations can be analysed. This will give 16 possible samples (sampling with replacement). Resulting in 16 Sample Means
- Next, the Sampling Distribution of All Sample Means must be found by finding the sample mean (in this case it would be all the 16 means added together and then being divided by 16) and by finding the St. Dev
- This can then be graphed to display the results
- Example on google doc
What is the Standard Error of the Mean used for?
- Different samples of the same size from the same population will yield different sample means
- A measure of the variability in the mean from sample to sample is given by the Standard Error of the Mean
- Example in google doc
What is assumed when using the Standard Error of the Mean?
- That sampling is done with replacement or sampling is done without replacement from a large or infinite population
- Note that the standard error of the mean decreases as the sample size increases
What is an easier way of finding the standard deviation of the sample means?
- To divide the standard deviation of the original population by the square root of the number of observations
- Google doc
What happens when the population is normal?
- A Normal Population Distribution means that there will be Normal Sampling Distribution (They have the same mean)
- This means that as n increases, the standard deviation of the sample means decreases
- In other words, If a population is normal with mean (μ) and standard deviation (σ), the sampling distribution of X-bar is also normally distributed
How would you find the Z value for the Sampling Distribution of the mean? Also, study the example for how to find the Z value for the sampling distribution of the mean.
https://docs.google.com/document/d/1r_ttbYs-4jXdkBbVGPH9vk1swjRRmRJUWdllcJXdaAI/edit?usp=sharing
What happens if the population is not Normal?
- If the Population Distribution is not normal then the sampling Distribution will become normal as n increases.